PCX 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 



Exhibit 1 

U.S. Patent Application Serial No. 
08/976,560 



INTERNATIONAL AFPUCATIOy PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCQ 



(51) International Patent Classification ^ : 

C07H 17/00, C07K 14/00, AOIN 43/04, 
A61K 3Sa4, C12Q 1/68, C12P 21A)6, 
C12N 15/00 



Al 



(11) IntemaUonai PublicaUon Number: WO 99/47535 

(43) International Publication Date: 23 September 1999 (23.09.99) 



(21) International Application Number: P(nyUS99/05606 

(22) International Filing Date: 16 March 1999 (16.03.99) 



(30) Priority Data: 
60/078,044 
60/088,312 
60/106.056 
09/236,134 



16 March 1998 (16.03.98) US 

5 June 1998 (05.06.98) US 

28 October 1998 (28.10.98) US 

22 January 1999 (22.01.99) US 



(71) Applicants (for all designated States except US): MILLEN- 

NIUM PHARMACEUTICALS, INC. [USAJS]; 238 Main 
^ Street, Cambridge. MA 02142-1017 (US). THE REGENTS 
OF THE UNTVERSITY OF CALIFORNIA [US/USJ; 300 
Lalceside Drive. Oakland, OA 94612 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): CHEN, Hong [CN/US]; 
46 Aspinwal Avenue #2, Brookline, MA 02146 (US). 
FREIMER. Nelson, B. [US/US]; 630 29tfi Street, San 
Francisco, CA 94131 (US). 

(74) Agents: CORU2ZI, Laura, A. et al.; Pennie & Edmonds LLP, 
1 155 Avenue of the Americas, New Yoric. NY 10036 (US). 



(81) Designated States: AE. AL, AM. AT, AU. AZ, BA, BB. BG, 
BR. BY, CA. CH. CN, CU, (2, DE. DK, EE, ES. FI, GB. 
GD, GE. GH. GM, HR, HU, ID, IL, IN, IS. JP. KE. KG 
KP, KR, KZ. LC. LK, Ut, LS. LT, LU, LV, MD. MG, MK. 
MN. MW, MX, NO, NZ, PL. PT, RO, RU, SD, SE, SG, SI 
SK, SL. TJ. TM, TR, TT. UA. UG, US, UZ, VN, YU, ZA. 
ZW, AREPO patent (GH, GM, KE, LS, MW. SD, SL. SZ. 
UG, ZW), Eurasian patent (AM, AZ, BY. KG, KZ, MD, 
RU, TJ. m), European patent (AT, BE, CH. CY. DE, DK, 
ES. H. FR. GB. GR, IE. IT. LU, MC. NL. FT, SE), OAPI 
patent (BF, BJ. CF, CG, Q. CM, GA, GN. GW, ML, MR. 
NE. SN, TD, TG). 



Published 

With international search report 
Before the expiration of the time limit for amending the 
claims and to be republished in the event of the recent of 
amendments. 



(54) TlUe: METHODS AND COMPOSITIONS FOR DIAGNOSING AND TREATING CHROMOSOME- 18p RELATED DISORDERS 




(57) Abstract 

The present invention relates to the mammalian UKNG) gene, a gene associated with bipolar affective disorder (BAD) in humans. 
The invention relates, in particular, to methods for the diagnostic evaluation, genetic testing and prognosis of HKNGI neuropsychiairic 
disorders including schizophrenia, attention deficit disorder, a schizoaffective disorder, a bipolar affective disonier or a unipolar affective 
disorder. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCX. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


AimenU 


VI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


R-ance 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azeitaijan 


GB 


United Kingdom 


rvic 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TO 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Betgimn 


GN 


Guinea 


MK 


The fomter Yogoshv 


TM 


'T>ir1cmcnistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United Sutes of America 


CA 


Canada 


IT 


baly 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


N£ 


Niger 


VN 


Viet Nam 


CG 


Congo 


KB 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzatan 


NO 


Norway 


ZW 


Zimbabwe 


a 


.JCAte d'lvoiit 


KP 


Democratic People's 


N2 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Inland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazalcstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saim Lucia 


RU 


Russian Federation 






DE 


Germany 


U , 


Liechtenuein 


SD 


Sudan 






DK 


Denroaric 


LK- 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR : 


Liberia 


SG 


Singapore 







wo 99/47535 



PCT/US99/05606 



METHODS AND COMPO SITIONS PQR DIAGNOSING AM) TREATING 
CHROMOS0ME-18P REIATED DISORDERS 

5 

This is a continuation-in-part of U.S. application no. 
09/236,134, filed on January 22, 1999 which claims priority 
under 35 U.S.C. § 119(e)(1) of provisional application no. 
60/078,044 filed on March 16, 1998, of provisional 
application no. 60/088,312 filed on June 5, 1998, and of 
10 provisional application no. 60/106,056 filed on October 28, 
1998, each of which is incorporated herein by reference in 
its entirety. 

1. INTRODUCTION 
The present invention relates, first, to the HKNGl gene, 
shown herein to be associated with central nervous system- 
related disorders, e.g. , neuropsychiatric disorders, in 
particular, bipolar affective disorder and schizophrenia and 
with myopia-related disorders. The invention includes 
recombinant DNA molecules and cloning vectors comprising 
sequences of the HKNGl gene, and host cells and non-human 
host organisms engineered to contain such DNA molecules and 
cloning vectors. The present invention further relates to 
HKNGl gene products, euid to zmtibodies directed against such 
HKNGl gene products. The present invention also relates to 
methods of using the HKNGl gene and gene product, including 
S5 drug screening assays, and diagnostic and therapeutic methods 
for the treatment of JEOOTGl -mediated disorders, including 
ITJOTGI-mediated neuropsychiatric disorders such as bipolar 
affective disorder, as well as HJOTGI-inediated myopia 
disorders such as early-onset autosomal dominant myopia. 
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2* BACKGRonwo OP vm TMVENTTnw 

oli„l!;r * P^y^i-tric disorders 1„ which 

llTT °^ disorder can be correlated 

with deMonstrable defects i„ the structure and/or fuhotion of 
^ the nervous syste.. Bell-known exa^ies of such dlsord«" 
include Huntington-s disease, which can he traced to a 
mutation in a single gene and in which neurons in the 

dl"r ^">^^-s disease, in which 

dopaminergic neurons i„ the nigro-strlatal pathway 
degenerate. The vast majority of psychiatric disorders 
however, presumably involve subtle and/or undetectable 
Changes, at the cellular and/or molecular levels, in nervous 
system structure and function, ^is laclc of det:ctable 
neurological defects distinguishes -neuropsychlatric» 
disorders, such as sohiMphrenla, attention deficit 
disorders, schizoaffective disorder, bipolar affective 
« T T^' " a«ective disorder, from neurological 

disorders, in which anatomical or biochemical pathologHr^e 
-nifest. Hence, identification of the causative def^ Z 
the neuropathologies of neuropsychlatric disorders are n^e^- 
in order to enable clinicians to evaluate „d prescribe 
appropriate courses of treatment to cure or ameliorate the 
symptoms of these disorders. 

one Of the most prevalent and potentially devastating of 
n^opsyohiatric disorders is bipolar affective dlsordL ' 
(B*D) , also known as bipolar mood disorder (BP) or manic- 
depressive illness. Which is characterized by episod" of 
elevated mood (mania, and depression (Goodwin, et al., i„« 
„ «anic x,epressive niness, Oxford University Press, Hew York,' 
The most severe and clinically distinctive forms of B«, are ' 
^-I (severe bipolar affective (mood, disorder,, which 

(schizoaffective disorder manic type,. n.ey are 
. Characterized by at least one full episode of mania, with or 
3. without episodes of major depression (defined by lo;err 

or depression with associated disturbances in rhythmic 
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behaviors such as sleeping, eating, and sexual activity) • 
BP-I often co-segregates in families with more etiological ly 
heterogeneous syndromes, such as with a unipolar affective 
disorder such as xuiipolar major depressive disorder (MDD) , 
which is a more broadly defined phenotype (Freimer and Reus, 
^ 1992, in The Molecular and Genetic Basis of Neurological 
Disease, Rosenberg, et al., eds., Butterworths, New York, pp. 
951-965; Mclnnes and Freimer, 1995, Curr. Opin. Genet. 
Develop., 5, 376-381) • BP-I and SAD-M are severe mood 
disorders that are frequently difficult to distinguish from 
one another on a cross-sectional basis, follow similar 
clinical courses, and segregate together in family studies 
(Rosenthal, et al . , 1980, Arch- General Psychiat. 37, 804- 
810; Levinson and Levitt, 1987, Am. J. Psychiat. 144, 415- 

- 426; Goodwin, et aJ., 1990, Manic Depressive Illness, Oxford 
University Press, llew York). Hence, methods for 

15 distinguishing neuropsychiatric disorders such as these are 
needed in order to effectively diagnose and treat afflicted 
individuals. 

Currently, individuals are typically evaluated for BAD 
using the criteria set forth in the most ctirrent version of 
the American Psychiatric Association's Diagnostic and 

20 Statistical Manual of Mental Disorders (DSM) . While memy 
drugs have been used to treat individuals diagnosed with BAD, 
including lithium salts, carbamazepine and valproic acid, 
none of the currently available drugs are adequate. For 
example, dmig treatments are effective in only approximately 
60-70% of individuals diagnosed with BP-I. Moreover, it is 

25 cxirrently impossible to predict which drug treatments will be 
effective in, for example, particular BP-I affected 
individuals* Commonly, upon diagnosis, affected individuals 
are prescribed one drug after another until one is found to 
be effective. Early prescription of an effective drug 
treatment, therefore, is critical for several reasons, 

30 including the avoidance of extremely dangerous manic 

episodes, the risk of progressive deterioration if effective 
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treatments are not found, and the risk of substantial side 
effects of current treatments. 

The existence of a genetic component for BAD is strongly 
supported by segregation analyses and twin studies 
(Bertelscn, et al,, 1977, Br. J. Psychiat. 130, 330-351; 
Freimer and Reus, 1992, in Th& Molecular and GBjistic Basis of 
Neurological Disease, Rosenberg, et al., eds., Butterworths , 
New York, pp. 951-965; Pauls, et al., 1992, Arch. Gen. 
Psychiat. 49, 703-708). Efforts to identify the chromosomal 
location of genes that might be involved in BP-I, however, 
10 have yielded disappointing results in that reports of linkage 
between BP-I and markers on chromosomes X and li could not be 
independently replicated nor confirmed in the re-analyses of 
the original pedigrees, indicating that with BAD linkage 
studies, even extremely high lod scores at a single locus, 
can be false positives (Baron, et al., 1987, Nature 326, 289- 
"292; Egeland, et al., 1987, Nature 325, 783-787; Kelsoe, et 
al., 1989, Nature 342, 238-243; Baron, et al,, 1993, Nature 
GSenet. 3, 49-55). 

Recent investigations have suggested possible 
localization of BAD genes on chromosomes 18p and 2lq, but in 
both cases the proposed candidate region is not well defined 
and no unequivocal support exists for either location 
(Berrettini, et al., 1994, Proc. Natl. Acad. Sci. USA 91, 

5918-5921; Murray, et al., 1994, Science 265, 2049-2054; 

Pauls, et al., 1995, Am. J. Hum. Genet. 57, 636-643; Maier, 
et al., 1995, Psych. Res. 59, 7-15; Straub, et al., 1994, 
Nature Genet. 8, 291-296). 

Mapping genes for common diseases believed to be caused 
by multiple genes, such as BAD, may be complicated by the 
typically imprecise definition of phenotypes, by etiologic 
heterogeneity, and by uncertainty about the mode of genetic 
30 ^^^"^"'^ssion of the disease trait. With neuropsychiatric 
disorders there is even greater ambiguity in distinguishing 
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individuals who likely carry an affected genotype from those 
who are genetically unaffected. For excunple, one can define 
an affected phenotype for BAD by including on or more of the 
broad grouping of diagnostic classifications that constitute 
the mood disorders: BP-I, SAD-M, MDD, and bipolar affective 
5 (mood) disorder with hypomania and major depression (BP-II) . 
Thus, one of the greatest difficulties facing 
psychiatric geneticists is uncertainty regarding the validity 
of phenotype designations, since clinical diagnoses are based 
solely on clinical observation and subjective reports. Also, 
with complex traits such as neuropsyehiatric disorders, it is 
^® difficult to genetically map the trait-causing genes because: 
(1) nexiropsychiatric disorder phenotypes do not exhibit 
classic Hendelian recessive or dominant inheritance patterns 
attributable to a single genetic locus, (2) there may be 
incomplete penetrance, i.e., individuals who inherit a 

predisposing allele may not manifest disease; (3) a phenocopy 
phenomenon may occur, i.e., individuals who do not inherit a 
predisposing allele may nevertheless develop disease due to 
environmental or random causes; (4) genetic heterogeneity may 
exist, in which case mutations in any one of several genes 
may result in identical phenotypes. 
2^ Despite these difficulties, however, identification of 

the chromosomal location, sequence and function of genes and 
gene products responsible for causing neuropsyehiatric 
disorders such as bipolar, affective disorders is of great . 
importance for genetic counseling, diagnosis and treatment of 
individuals in affected families. 

25 

3. SUMMARY OF THE INVENTION 
The present invention relates, first, to the discovery, 
identification, and characterization of novel nucleic acid 
molecules that are associated with central nervous system- 
related disorders and processes, e.g., hiunan neuropsyehiatric 

3Q disorders, such as schizophrenia, attention deficit disorder, 
schizoaffective disorder, dysthymic disorder, major 

- 5 - 



wo 99/47535 



PCT/US99/05606 



depressive disorder, and bipolar affective disorder (BAD) 
including severe bipolar affective (B.ood) disorder (BP-I) 
bxpolar affective (.ood, disorder with hypo.ania and .ajo^ 
depression (BP-li) . The invention further relates to the 
discovery, identification, and characterization of proteins 
encoded by such nucleic acid molecules, or by degenerate " 
e.y., allelic or homologous, variants thereof. The invention 
further relates to the discovery, identification, and 
characterization of novel nucleic acid molecules that are 
associated with human myopia or nearsightedness, such as 
early-onset, autosomal dominant myopia, as well as to the 
discovery, identification, and characterization of proteins 
encoded by such nucleic acid molecules or by degenerate 
variants thereof. 

in particular, the nucleic acid molecules of the present 
invention represent, first, nucleic acid sequences 
corresponding to the gene referred to herein as HKNGl, As 
d^onstrated in the Examples presented below in Sections 6, 8 
and 14, the BKNGl gene is associated with human CNS-related 
disorders, e.y., neuropsychiatric disorders, in particular 
BAD. The HKNGl gene is associated with other human 
neuropsychiatric disorders as well, such as schizophrenia. 
AS demonstrated in the Example presented below in Section- 14 
the HENGl gene is also associated with human myopia, such as' 
early-onset autosomal dominant myopia. 

in addition to the positive correlation between 
mutations within the EKNGl gene and individuals exhibiting 
„ symptoms of BAD, described in Section 6 and 8, the present 
invention is further based, in part, on Applicants' discovery 
Of novel HKNGl cDNA sequences. Applicants' discovery of such 
CDNA sequences has led to the elucidation of the HKNGl 
genomic (that is, upstream untranslated, intron/exon, and 
30 ""^--^-^-<^) structure, and to the discovery of 

30 full-length and alternately spliced HKNGl variants and the 
polypeptides encoded by such variants. These discoveries are 
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described in Sections 7 and 10, below. Applicants' discovery 
of such cDNA sequences has also led to the elucidation of 
novel mammalian (e.g., guinea pig and bovine) HKNGl 
sequences, and to the discovery of novel allelic variants and 
^ polymorphisms of such sequences. These discoveries are 
described in Section 10 below. 

The invention encompasses nucleic acid molecules which 
comprise the following nucleotide sequences: (a) nucleotide 
sequences (e.g., SEQ ID NOS: 1, 3, 5, 6. 36, and 37) that 
comprise a human HKNGl gene and/or encode a human HKNGl gene 
10 product (e.g., SEQ ID NO: 2; SEQ ID NO: 4), as well as 

allelic variants, homologs and orthologs thereof, including 
nucleotide sequences (e.g., SEQ ID NOS:38, 40, 42, 44, and 
46-48) that encode non-human HKNGl gene products (e.g., SEQ 
ID NOS: 39, 41, 43, 45, and 49); (b) nucleotide sequences 
comprising the novel HKNGl sequences disclosed herein that 
encode mutants of the HKNGl gene product in which sequences 
encoding all or a part of one or more of the HKNGl domains is 
deleted or altered, or fragments thereof; (c) nucleotide 
sequences that encode fusion proteins comprising a HKNGl gene 
product (e.g., SEQ ID NO: 2; SEQ ID NO: 4), or a portion 
thereof fused to a heterologous polypeptide; and (d) 
nucleotide sequences within the HKNGl gene, as well as 
chromosome I8p nucleotide sequences flanking the HKNGl gene, 
which can be utilized, e.g., as primers, in the methods of 
the invention for identifying and diagnosing individuals at 
25 risk for or exhibiting an HKWGl-mediated disorder, such as 
BAD or schizophrenia, or for diagnosing individuals at risk 
for or exhibiting a form of myopia such as early-onset 
autosomal dominant myopia. The nucleic acid molecules of (a) 
through (d) , above, can include, but are not limited to, 
CDNA, genomic DNA, and RNA sequences. 



20 



30 
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The invention also encompasses the expression products 
Of the nucleic acid molecules listed above; i.e., peptides 
proteins, glycoproteins and/or polypeptides that are encoded 
by the above HKNGl nucleic acid molecules. 
5 The compositions of the present invention further 

encompass agonists and antagonists of the BKNGl gene product 
xncluding small molecules (such as small organic molecules) ' 
and macromolecules (including antibodies), as well as 
nucleotide sequences that can be used to inhibit EKNGl gene 
expression (e.g., antisense and ribozyme molecules, and gene 

rLL'^''"""^ replacement constructs) or to enhance 

EKNGl gene expression (e.g., expression constructs that place 
the EKNGl gene under the control of a strong promoter 
system) . 

The compositions of the present invention further 

" nu!^' """^""^ expression vectors containing the 

nucleic acad molecules of the invention, as well as hosts 
whaph have been transformed with such nucleic acid molecules, 
xncludang cells genetically engineered to contain the nuclei^ 
acid molecules of the invention, and/or cells genetically 
engineered to express the nucleic acid molecules of the 
2, invention, m addition to host cells and cell lines, hosts 
also include transgenic non-human animals (or progeny 
thereof), particularly non-human mammals, that have been 
engineered to express an EKNGl. transgene, or "knock-outs" 
that have been engineered to not express HKNGl, 

Transgenic non-human animals of the invention include 
25 anxmals engineered to express an HKNGl transgene at higher or 

^II/Tl"^" "'^"-^^^ transgenic 

anxmals of the invention also include animals engineered to 
express a mutant variant or polymorphism of an HKNGl 
transgene which is associated with ffiO^Gi-mediated disorder, 
30 for example an FiCWGl -mediated neuropsychiatric disorders 
such as BAD and schizophrenia, or, alternatively, a myopL 
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5 



disorder such as early-onset autosomal dominant myopia. The 
transgenic animals of the invention further include the 
progeny of such genetically engineered animals. 

The invention further relates to methods for the 
treatment of fflO^Gl -mediated disorders in a subject, such as 
HJOTGl-mediated neuropsychiatric disorders and ffJTWGl -mediated 
myopia disorders, wherein such methods comprise administering 
a compound which modulates the expression of a HKNGl gene 
and/or the synthesis or activity of a HKNGl gene product so 
symptoms of the disorder are ameliorated. 
10 The invention further relates to methods for the 

treatment of fflWGI-mediated disorders in a subject, such as 
fflWGl-mediated neuropsychiatric disorders and HJWGl -mediated 
myopia disorders, resulting from HKNGl gene mutations or 
aberrant levels of HKNGl expression or activity, wherein such 
IS methods comprise supplying the subject with a nucleic acid 
molecule encoding an unimpaired HKNGl gene product such that 
an unimpaired HKNGl gene product is expressed and symptoms of 
the disorder are ameliorated. 

The invention further relates to methods for the 
treatment of fflWGI-mediated disorders in a subject, such as 
HKWGl-mediated neuropsychiatiric disorders and ifJOTGl-mediated 
myopia disorders, resulting from BKNGl gene mutations or from 
aberrant levels of expression or activity, wherein such 
methods comprise supplying the subject with a cell comprising 
a nucleic acid molecule that encodes an unimpaired HKNGl gene 
25 product such that the cell expresses the unimpaired HKNGl 
gene product and symptoms of the disorder are ameliorated. 

The invention also encompasses pharmaceutical 
formulations and methods for treating FJOTGl -mediated 
disorders, including neuropsychiatric disorders, such as BAD 
and schizophrenia, and myopia disorders, such as early-onset 
autosomal dominant myopia, involving HKNGl genfe. 



20 
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In addition, the present invention is directed to 
methods that utilize the HKNGl nucleic acid sequences. 
Chromosome I8p nucleotide sequences flanking the HKNGl human 
gene and/or HKNGl gene product sequences for mapping the 
5 chromosome 18p region, and for the diagnostic evaluation 
genetic testing and prognosis of a if JTWGI -mediated disorder, 
such as a ffJTNGI -mediated neuropsychiatric disorder or a 
HKWGI-mediated myopia disorder. For example, in one 
embodiment, the invention relates to methods for diagnosing 
fflOVGl-aediated disorders, wherein such methods comprise 
measuring HKNGl gene expression in a patient sample, or 
detecting a HKNGl polymorphism or mutation in the genome of a 
mammal, including a human, suspected of exhibiting such a 
disorder, in one embodiment, nucleic acid molecules encoding 
HKNGl can be used as diagnostic hybridization probes or as 
« primers for diagnostic PCR analysis for the identification of 
HKNGl gene mutations, allelic variations and regulatory 
defects in the HKNGl gene which correlate with 
neuropsychiatric disorders such as BAD or schizophrenia. 
The invention still further relates to methods for 
20 identifying compounds which modulate the expression of the 
HKNGl gene and/or the synthesis or activity of the HKNGl gene 
products, including therapeutic compounds, which reduce or " 
^-^^^^^ the symptoms of ffiOTGl-mediated disorders, including^ 
IfJOfGI-mediated neuropsychiatric disorders such as BAD and 
schizophrenia. In particular, cellular and non-cellular 
assays are described that can be used to identify compounds 
that interact with the HKNGl. gene product, e^, modulate the 
activity of the HKNGl and/or bind to the HKNGl gene product. 
Such cell-based assays of the invention utilize cells, cell 
_ lines, or engineered cells or cell lines that express the 
SKNGl gene product. 
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in one embodiment, such methods comprise contacting a 
compound to a cell that expresses a HKNGl gene, measuring the 
level of HKNGl gene expression, gene product expression or 
gene product activity, and comparing this level to the level 
5 of HKNGl gene expression, gene product expression or gene 
product activity produced by the cell in the absence of the 
compound, such that if the level obtained in the presence of 
the compound differs from that obtained in its absence, a 
compound that modulates the expression of the HKNGl gene 
and/or the synthesis or activity of the HKNGl gene products 
^° has been identified. 

in another embodiment, such methods comprise contacting 
a compound to a cell that expresses a HKNGl gene and also 
comprises a reporter construct whose transcription is 
dependent, at least in part, on HKNGl expression or activity. 
^5 in such an embodiment, the level of reporter transcription is 
measured and compared to the level of reporter transcription 
xn the cell in the absence of the compound. If the level of 
reporter transcription obtained in the presence of the 
coinpound differs from that obtained in its absence, a 
compound that modulates expression of HKNGl or genes involved 
20 in fflO^Gl-related pathways or signal transduction has been 
identified. 

In yet another embodiment, such methods comprise 
administering a compound -to a host, such as a transgenic " 
animal, that expresses an HKNGl transgene or a mutant HKNGl 
transgene associated with an HJWGl-mediated disorder such as 
a neuropsychiatric disorder (e.g., BAD or schizophrenia), or 
to an animal, e.g., a knock-out animal, that does not express 
HKNGl, and measuring the level of HKNGl gene expression, gene 
product expression, gene product activity, or symptoms of an 
5Ja7G2-mediated disorder such as an HJWGl -mediated 
neuropsychxatric disorder (e.g., BAD or schizophrenia). The 
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measured level is compared to the level obtained in a host 
that is not exposed to the compound, such that if the level 
obtained when the host is exposed to the compound differs 
from that obtained in a host not exposed to the compound, a 
^ compound modulates the expression of the mammalian HKNGl gene 
and/or the synthesis or activity of the mammalian HKNGl gene 
products, and/or the symptoms of an HJWGI -mediated disorder 
such as a neuropsychiatric disorder (e.g., BAD or 
schizophrenia), has been identified. 

The present invention still further relates to 
10 pharmacogenomic and pharmacogenetic methods for selecting an 
effective drug to administer to an individual having a EKNGl- 
mediated disorder. Such methods are based on the detection 
of genetic polymorphisms in the HKNGl gene or variations in 
HKNGl gene expression due to, e.g., altered methylation, 
^5 differential splicing, or post-translational modification of 
the HKNGl gene product which can affect the safety and 
efficacy of a therapeutic agent. 

As briefly discussed above, the present invention is 
based, in part, on the genetic and physical mapping of the 
HKNGl gene to a specific portion of the short arm of human 
20 chromosome 18 that is associated with human neuropsychiatric 
disorders, in particular, bipolar affective disorder. These 
results are described in the Example presented, below, in 
section 6. - The-invfention is also based on the elucidation of ' 
the EKGNl nucleotide sequence, amino acid sequence and 
expression pattern, as described in the Example presented, 
below, in Section 7. The invention is further based on the 
identification of specific mutations and/or polymorphisms 
within the HKNGl gene which positively correlate with 
neuropsychiatric disorders, in particular, BAD, as described 
in the Example presented below in Section 8. These mutations 
include a point mutation discovered in an individual affected 
by BAD which is absent from the corresponding wild-type 
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nucleic acid derived from non-affected individuals and 
linkage disequilibrium of three markers showing cosegregation 
with a population of individuals with BAD. This mutation is 
a single base mutation which results in a mutant HKNGl gene 
^ product ccnprising substitution of a lysine residue for the 
wild-type glutamic acid residue at HKNGl amino acid position 
202 of the polypeptide of SEQ ID NO: 2 or the HKNGl amino acid 
residue 184 of the polypeptide of SEQ ID N0:4. These 
mutations further include the mutations discovered in 
schizophrenic and BAD patients that are detailed in FIGS. 5A- 
10 SB. 

3.1. DEFINITIONS 

As used herein, the following terms shall have the 
abbreviations indicated. 

^5 BAG, bacterial artificial chromosomes 

BAD, bipolar affective disorder (s) 
BP, bipolar mood disorder 

BP-I, severe bipolar affective (mood) disorder 
BP-II, bipolar affective (mood) disorder with 
hypomsmia and major depression 
20 bp, base pair(s) 

EST, expressed sequence tag 
HKNGl, Hong Kong new gene 1 
lod, logarithm of odds 

HDD, unipolar major depressive disorder 

ROS, reactive oxygen species 
25 RT-PCR, reverse treuiscriptase PGR 

SSCP, single-stranded conformational polymorphism 

SAD-M, schizoaffective disorder manic type 

STS, sequence tagged site 

YAC, yeast artificial chromosome 
"^rXNGI -mediated disorders" include disorders involving 
^® an aberrant level of HKNGl gene expression, gene product 
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synthesis and/or gene product activity relative to levels 
found in Clinically normal individuals, and/or relative to 
levels found in a population whose level represents a 
baseline, average HKNGl level. While not wishing to be bound 
by any particular mechanism, it is to be understood that 
disorder symptoms can, for example, be caused, either 
directly or indirectly, by such aberrant levels. 
Alternatively, it is to be understood that such aberrant 
levels can, either directly or indirectly, ameliorate 
disorder symptoms, (e.g., as in instances wherein aberrant 
levels of SKNGl suppress the disorder symptoms caused by 
mutations within a second gene). 

flWJTGl-mediated disorders include, for example, central 
nervous system (CNS) disorders. CNS disorders include, but 
are not limited to cognitive and neurodegenerative disorders 
such as Alzheimer's disease, senile dementia, Huntington's 
X5 disease, amyotrophic lateral sclerosis, and Parkinson's 
disease, as well as Gilles de la Tourette's syndrome, 
autonomic function disorders such as hypertension and sleep 
disorders, and neuropsychiatric disorders that include, but 
are not limited to schizophrenia, schizoaffective disorder 
attention deficit disorder, dysthymic disorder, major 
20 depressive disorder, mania, obsessive-compulsive disorder, 
psychoactive substance use disorders, anxiety, panic 
disorder, as well as bipolar affective disorder, e.g., severe 
bipolar affective (mood) disorder (BP-I) , bipolar affective 
(mood) disorder with hypomania and major depression (BP-II) 
Further CNS-related disorders include, for example, those 
25 listed in the American Psychiatric Association's Diagnostic 
and Statistical manual of Mental Disorders (DSM) , the most 
current version of which is incorporated herein by reference 
in its entirety. 

"flJOTGi-mediated processes- include processes dependent 
^-^ and/or responsive, either directly or indirectly, to levels 
of HKNGl gene expression, gene product synthesis and/or gene 



- 14 - 



^O^'^^'S^^ • PCT/US99/05606 



product activity, such processes can include, but are not 
limited to, developmental, cognitive and autonomic neural and 
neurological processes, such as, for example, pain, appetite, 
long term memory and short term memory. 

5 

^' BRIEF PER CRIPTTOW HP THE PTgpREB 

FIG. lA-iB. The nucleotide sequence of human HKNGl 
CDNA (SEQ ID no: 1) is depicted on the bottom line. The top 
line depicts the full length amino acid sequence of human 
HKNGl polypeptide (SEQ ID KO: 2) encoded by the human HKNGl 
10 CDNA sequence. The nucleotide sequence encoding SEQ id N0:2 
corresponds to SEQ ID NO: 5. 

FIG. 2A-2B. Nucleotide sequence of an alternately 
spliced human HKNGl variant, referred to as HKNGl-Vl (SEQ ID 
no: 3) is depicted on the bottom line. The derived amino 
acid sequence of the human HKNGl gene product (SEQ id NO: 4) 
" encoded by this alternately spliced cDNA variant is depicted 
on the top line. The nucleotide sequence encoding SEQ ID 
NO: 4 corresponds to SEQ ID NO: 6 

PIG. 3A-3R. Genomic sequence of the human HKNGl gene 
(SEQ ID NO. 7). Exons are in bold and the 3' and 5' DTRs 
(untranslated regions) are underlined. 

PIG. 4. summary of in situ hybridization analysis of 
HKNGl mRNA distribution in normal human brain tissue. 

FIGS. 5A-B. HKNGl polymorphisms relative to tha HKNGl 

wild-type sequence. These polymorphisms were isolated from a 
collection of schizophrenic patients of mixed ethnicity from 
25 the united States (pig. 5A) and from the San Francisco BAD 
collection (fig. 5B) . 

FIGS. 6A-B. Nucleotide sequence of the RT-PCR products 
for HKNGI-V2 (FIG. 6A; SEQ ID NO: 36) and HKNG1-V3 (FIG. 6B; 
SEQ ID NO: 37) . 

FIG 7. The CDNA sequence (SEQ ID NO: 38) and the 
30 predicted amino acid sequence (SEQ id N0:39) of the guinea 
pig HKNGl ortholog gphJaigiBlS. 
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PIG. 8. The cDNA sequence (SEQ ID NO: 40) and the 
predicted amino acid sequence (SEQ ID NO: 41) of gpbkng 7b, an 
allelic variant of the guinea pig EI^SGl ortholog gphknglBlS, 
FIG. 9. The CDNA sequence (SEQ ID NO: 42) and the 
5 predicted amino acid sequence (SSQ ID NO: 43) of gphJzng 7c, an 
allelic variant of the guinea pig SKNGl ortholog gphkngiaiB, 

FIG, 10. The CDNA sequence (SEQ ID NO:44)and the 
predicted amino acid sequence (SEQ ID NO: 45) of gpbkng 7d. an 
allelic variant of the guinea pig HKmi ortholog gphknglBls, 

FIG. 11. The CDNA sequence (SEQ ID NO: 46) and the 
predicted amino acid sequence (SEQ ID NO:49) of the allelic 
variant bbkngl of the bovine HKNGl ortholog. 

FIG. 12. The CDNA sequence (SEQ ID NO: 47) and the 
predicted amino acid sequence (SEQ ID NO: 49) of the allelic 
variant bbkng2 of the bovine HKNGl homolgue. 
15 FIG. 13. The CDNA sequence (SEQ ID NOi48) and the 

predicted amino acid sequence (SEQ id N0:49) of the allelic 
variant bbkxiga of the bovine HKNGl homolgue. 

FIG. 14A-B. Alignments of the guinea pig HKNGl cDNA 
(FIG. 14A) and predicted amino acid (FIG. 14B) sequences for 
gphknglBls, gphkng 7b, gphkng7c, and gphkng 7d. 

FIG. 15. Alignments of the cDNA sequences of the bovine 
HKNGl allelic variants bbkngl, bbkng2. and bbkng3. 

FIG. 16. Alignments of the human (hkng__aa) , bovine 
(bhkngi_aa) and guinea pig (gphkngl8l5_aa) HKNGl amino acid 
sequences. 

FIG. 17. Alignments of the human HKNGl protein 
sequences; top line: the mature secreted HKNGl protein 
sequence (SEQ ID N0:51); second line: immature HKNGl protein 
form 1 (iPFi; SEQ ID N0:2); third line: immature HKNGl 
protein form 2 (IPF2; SEQ ID N0:64); bottom line: immature 
HKNGl protein form 3 (IPr3; SEQ id NO: 4). 

FIG. 18. The nucleotide sequence of human HKNGl splice 
variant HKNG1A7 cDNA (SEQ ID NO: 65) is depicted on the 
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15 



bottom line. The top line depicts the full length amino acid 
sequence of human HKNG1A7 polypeptide (SEQ ID NO: 66) encoded 
by the human HKNGl A7 cDNA sequence. 

5 ^' PBTAILED DESCRTWTn y pp THE TNVENTTOM 

5«1« THE HKNm mrwT? 
HKNGl nucleic acid molecules are described in the 
section, unless otherwise stated , the term "HKNGl nucleic 
acid- refers collectively to the sequences described herein. 
A human HKNGl cDNA sequence (SEQ ID NO: 1) encoding the 
10 full length amino acid sequence (SEQ ID NO: 2) of the HKNGl 
polypeptide is shown in FIG. ia-ib. The human HKNGl gene 
encodes a secreted polypeptide of 495 amino acid residues, as 
shown in FIG. lA-iB, and seq id no: 2. The nucleotide 
sequence of the portion of the cDNA corresponding to the 
coding sequence for HKNGl (SEQ id N0:2) is depicted as SEQ ID 
NO: 5. 

The HKNGl sequences of the invention also include splice 
variants of the HKNGl sequences described herein. For 
example, an alternately spliced human HKNGl cDNA sequence, 
referred to as HKNGl-Vl (SEQ ID NO: 3) encoding a human H^Gl 
variant gene product (i.e., the HKNGl-Vl gene product) is 
shown in FIG. 2A-2B. This splice variant of a human HKNGl 
gene encodes a secreted polypeptide of 477 amino acid 
residues, as shown in PIG. 2A-2B, and SEQ ID NO:4. The 
nucleotide sequence of the portion of the cDNA corresponding 
25 to the coding sequence for HKNGl (SEQ ID N0:4) is depicted in 
SEQ ID NO: 6. 

Another alternately spliced human HKNGl cDNA sequence 
(SEQ ID NO: 65), referred to as HKNG1A7, encodes a second " 
HKNGl variant gene product (the HKNGia? gene product) which 
30 is depicted in FIG. 18. This splice variant of the human 



20 
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SKNGi gene encodes the variant polypeptide shown in FIG. 18 
(SEQ ID MO: 66). 

The genomic structure of the human BKNGl gene has been 
elucidated and is depicted in FIG. 3A-3R, with the HKNGl 
5 exons indicated in bold type, and the 5'- and 3 • -untranslated 
regions indicated by underlining. The wild-type genomic 
sequence of the HKNGl gene is depicted in FIG. 3A-3R and SEQ 
ID NO: 7. 

Non-human homologues or orthologs mammalian orthologs, 
e.^, of the human HKNGl sequences discussed above are also 
10 provided. Specifically, a guinea pig cDNA sequence (SEQ ID 
NO: 38), referred to herein as gphJcngl 815, encoding the full 
length amino acid sequence (SEQ ID NO: 39) of a guinea pig 
HKNGl ortholog is shown in FIG. 7. This guinea pig cDNA 
sequence encodes a gene product of 466 amino acid residues, 
as shown in FIG. 7 and in SEQ ID NO: 39. 

Allelic variants of this guinea pig HKNGl ortholog, 
referred to as gphkng 7b, gphJaig 7c, and gphkng 7d (SEQ ID 
NOS:40, 42, and 44, respectively), are shown in FIGS. 8-10, 
respectively. The allelic variants gphkng7b, gpbkng7c, and 
gpli]aig7d each encode variants of the guinea pig gpbknglBlS 
20 HKNGl gene product which contain deletions of 16, 92, and 93 
amino acids, respectively, as shown in FIGS. 8-10, in SEQ ID 
NOS:41, 43, and 45, respectively, and in the sequence 
alignment in FIG. 14B. 

Bovine HKNGl ortholog cDNA sequences (SEQ ID NOS: 46- 
48), referred to herein as bbkngl, bhkng2, and bbkngS, and 
each encoding the same bovine ortholog gene product are shown 
in FIGS. 11-13, respectively. The bovine HKNGl allelic 
variants encode the same gene product, i.e., a 465 amino acid 
protein, as shown in FIGS. 11-13 and in SEQ ID NO: 49. 

The HKNGl gene nucleic acid molecules of the invention 

30 . 

include: (a) nucleotide sequences and fragments thereof 
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(e^, SEQ ID Nos: 1, 3, 5, 6, 7, 36, 37, and 65) that encode 
a HXNGl gene product (e.g., sEQ ID NOS: 2, 4 and 66), as well 
as homologues, orthologs and allelic variants of such 
sequences and fragments thereof (e.g., SEQ ID NOS:38, 40, 42, 
5 44, and 46-48) which encode homologue or ortholog HKNGl gene 
products (e.g., SEQ ID NOS:39, 41, 43, 45, and 49); (b) 
nucleotide sequences that encode one or more functional 
domains of a HKNGl gene product including, but not limited 
to, nucleic acid sequences that encode a signal sequence 
domain, or one or more clusterin domains as described in 
10 section 5.2 below; (c) nucleotide sequences that comprise 
HKNGl gene sequences of upstream untranslated regions, 
intronic regions, and/or downstream untranslated regions or 
fragments thereof of the HKNGl nucleotide sequences in (a) 
above; (d) nucleotide sequences comprising the novel HKNGl 
15 sequences disclosed herein that encode mutants of the HKNGl 
gene product in which all or a part of one or more of the 
domains is deleted or altered, as well as fragments thereof; 
(e) nucleotide sequences that encode fusion proteins 
comprising a HKNGl gene product (e.g., SEQ ID NO; 2, 4, 39, 
41, 43, 45, 49 and 65), or a portion thereof fused to a 
20 heterologous polypeptide; and (f) nucleotide sequences (e^g., 
primers) within the HKNGl gene, and chromosome 18p nucleotide 
sequences flanking the HKNGl gene which can be utilized as 
part of the methods of the invention for identifying and 
diagnosing individuals at risk for or exhibiting an HKNGl- 
25 mediated disorder, such as BAD, or myopia. 

The HKNGl nucleotide sequences of the invention further 
include nucleotide sequences corresponding to the nucleotide 
sequences of (a) -(f) above wherein one or more of the exons, 
or fragments thereof, have been deleted. In one preferred ' 
- embodiment, the HKNGl nucleotide sequence of the invention is 

«9 V 
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a sequence wherein the exon corresponding to exon 7 of SEQ id 
N0:7, or a fragment thereof, has been deleted. 

The HKNGl nucleotide sequences of the invention also 
include nucleotide sequences that have at least 65%, 70% 
^ 75% 80%, 85%, 90%, 95%, 98%, or more nucleotide sequence 
xdentxty to the HKNGl nucleotide sequences of (a)-(f, above 
The HKNGl nucleotide sequences of the invention further 
include nucleotide sequences that encode polypeptides having 
at least 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or higher 
amxno acid sequence identity to the polypeptides encoded by 
10 the HKKGl nucleotide sequences of (a)-(f,, e^, sEQ ID NOS: 
2, 4, 39, 41, 43, 45, 49, and 66 above. 

TO determine the percent identity of two amino acid 
sequences or of two nucleic acids, the sequences are aligned 
for optxmal comparison purposes (e.g., gaps can be introduced 
xn the sequence of a first amino acid or nucleic acid 
sequence for optimal alignment with a second amino or nucleic 
acxd sequence) . The amino acid residues or nucleotides at 
corresponding amino acid positions or nucleotide positions 
are then compared. When a position in the first sequence is 
occupxed by the same amino acid residue or nucleotide as the 
corresponding position in the second sequence, then the 
molecules are identical at that position. The percent 
identity between the two sequences is a function of the 
number Of identical positions shared by the sequences (i.e., 
% identxty = # of identical overlapping positions/total # of 
posxtxons X 100%). in one embodiment, the two sequences are 
tine same length. 

" The determination of percent identity between two 

sequences can also be accomplished using a mathematical 
algorithm, a preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of two 
sequences is the algorithm of Karlin and Altschul (1990) 

.- Proc. Natl. Acad. Sci. OSA 87:2264-2268, modified as in 
Karlin and Altschul (1993)Proc. Natl. Acad. Sci. nsA 
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90:5873-5877. Such an algorithm is incorporated into the 
NBIAST and XBLAST programs of Altschul, et al. (1990) J, Mol, 
Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = lOO, wordlength = 
S 12 to obtain nucleotide sequences homologous to a nucleic 
acid molecules of the invention. BLAST protein searches can 
be performed with the XBLAST program, score = 50, wordlength 
= 3 to obtain amino acid sequences homologous to a protein 
molecules of the invention. To obtain gapped alignments for 
comparison purposes. Gapped BLAST can be utilized as 
10 described in Altschul et al. (1997) Nucleic Acids 

J?es.25:3389-3402. Alternatively, PSI-Blast can be used to 
perform an iterated search which detects distant 
relationships between molecules {Id.). When utilizing BLAST, 
Gapped BLAST, and PSI-Blast programs, the default parameters 
of the respective programs (e.g., XBLAST and NBLAST) can be 
used (see http://www.ncbi.nlm.nih.gov). Another preferred, 
non-limiting example of a mathematical algorithm utilized for 
the comparison of sequences is the algorithm of Myers and 
Miller, (1988) CASJOS 4:11-17. Such an algorithm is 
incorporated into the ALIGN program (version 2.0) which is 
20 part of the GCG sequence alignment software package. When 
utilizing the ALIGN program for comparing amino acid 
sequences, a PAM120 weight residue table, a gap length 
penalty of 12, and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described above, 
25 with or without allowing gaps, m calculating percent 
identity, typically only exact matches are counted. 

The EKNGl nucleotide sequences of the invention further 
include: (a) any nucleotide sequence that hybridizes to a 
EKNGl nucleic acid molecule of the invention under stringent 
conditions, e.g., hybridization to filter-bound DNA in 6x 

30 

sodaum chloride/sodium citrate (SSC) at about 45''C followed 
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by one or more washes in 0.2xSSC/0.1% SDS at about 50-65*C, 
or (b) under highly stringent conditions, e.g., hybridization 
to filter-bound nucleic acid in 6xSSC at about 45''C followed 
by one or more washes in O.lxSSC/0.2% SDS at about 68 °C, or 
^ under other hybridization conditions which are apparent to 
those of skill in the art (see, for example, Ausubel F.M. et 
al., eds., 1989, Current Protocols in Molecular Biology, Vol. 
I, Green Publishing Associates, Inc., and John Wiley & sons. 
Inc., New York, at pp. 6.3.1-6.3.6 and 2.10.3). Preferably 
the EKHGl nucleic acid molecule that hybridizes to the 
nucleotide sequence of (a) and (b) , above, is one that 
comprises the complement of a nucleic acid molecule that 
encodes a HKNGl gene product. In a preferred embodiment, 
nucleic acid molecules comprising the nucleotide sequences of 
(a) and (b) , above, encode gene products, e.g., gene products 
functionally ec[uivalent to an HENGl gene product. 

Functionally equivalent HKNGl gene products include 
naturally occurring HKNGl gene products present in the same 
or different species. In one embodiment, BKNGl gene 
sequences in non-human species map to chromosome regions 
syntenic to the human 18p chromosome location within which 
20 human HKNGl lies. Functionally equivalent HKNGl gene 

products also include gene products that retain at least one 
of the biological activities of the J5KWG1 gene products, 
and/or which are recognized by and bind to antibodies 
(polyclonal or monoclonal) directed against the SKNGl gene 
products. 

25 

Among the nucleic acid molecules of the invention are 
deoxyoligonucleotides ("oligos") which hybridize under highly 
stringent or stringent conditions to the HKNGl nucleic acid 
molecules described above. In general, for probes between 14 
and 70 nucleotides in length the melting temperature (TM) is 
calculated using the formula: 

Tm(''C)=81.5+16. 6 (log [monovalent cations (moleir) ])+o.4l (% 
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G+C)-(500/N) where N is the length of the probe. If the 
hybridization is carried out in a solution containing 
formamide, the melting temperature is calculated using the 
equation Tm(''C)=81.5+l6.6(log[monovalent cations 
(molar) ])+o.41(% G+C)-(o.61% formamide)-(500/N) where M is 
S the length of the probe, in general, hybridization is 
carried out at about 20-25 degrees below Tm (for DNA-dna 
hybrids) or 10-15 degrees below Tm (for RNA-DNA hybrids). 

Exemplary highly stringent conditions may refer, e.g. , 
to washing in 6xSSC/0.05% sodium pyrophosphate at 37»c (for 
about 14-base oligos) , 48''C (for about 17-base oligos) , SS'C 
0 (for about 20-base oligos) , and 60-c (for about 23-base 
oligos). 

These nucleic acid molecules may encode or act as 
antisense molecules, useful, for example, in BKNGl gene 
regulation, and/or as antisense primers in amplification 
reactions of HKNGl gene nucleic acid sequences. Further, 
such sequences may be used as part of ribozyme and/or triple 
helix sequences, also useful for HKNGl gene regulation. 
Still further, such molecules may be used as components of 
diagnostic methods whereby, for example, the presence of a 
particular HKNGl allele involved in a iSJOTGl -related disorder, 

20 s*£U, a neuropsychiatric disorder, such as BAD, may be 
detected. 

Fragments of the HKNGl nucleic acid molecules can be at 
least 10 nucleotides in length, in alternative embodiments, 
the fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100, 200, 300, 400, 500, 1000, 1500, 2000, 2500, 3000, 3500, 
4000, 4500, 5000, or more contiguous nucleotides in length.' 
Alternatively, the fragments can comprise sequences that 
encode at least lo, 20, 30, 40, 50, lOO, 150, 200, 250, 300, 
350, 400, 450 or more contiguous amino acid residues of the 
HKNGl gene products. Fragments of the HKNGl nucleic acid 
molecules can also refer to HKNGl exons or introns, and, 
further, can refer to portions of HKNGl coding regions that 
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encode domains (e.q, , cluster in domains) of HKNGl gene 
products. 

The HKNGl nucleotide sequences of the invention Ccui be 
readily obtained, for example, by standard sequencing and the 
^ sequence provided herein. 

As will be appreciated by those skilled in the art, DNA 
sequence polymorphisms of a HKNGl gene will exist within a 
population of individual organisms (e.g., within a human 
population) . Such polymorphisms may exist, for example, 
among individuals within a population due to natural allelic 

10 variation. Such polymorphisms include ones that lead to 
changes in amino acid sequence. An allele is one of a group 
of genes which occur alternatively at a given genetic locus. 

As used herein, the phrase "allelic variant" refers to a 
nucleotide sequence which occurs at a given locus or to a 
gene product encoded by that nucleotide sequence. Such 

15 natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of a given gene. 
Alternative alleles can be identified by sequencing the gene 
of interest in a number of different individuals. This can 
be readily carried out by using hybridization probes to 
identify the same genetic locus in a variety of individuals. 

20 As used herein, the terms "gene" and "recombinant gene" r^fer 
to nucleic acid molecules comprising an open reading frame 
encoding a polypeptide of the invention. The term cem 
further include nucleic acid molecules comprising upstream 
and/ or exon/intron sequences and sructure. 

With respect to HKNGl allelic variants, any and all such 

2® nucleotide variations and resulting amino acid polymorphisms 
or variations that are the result of natural allelic 
variation of the HKNGl gene are intended to be within the 
scope of the present invention. Such allelic variants 
include, but are not limited to, ones that do not alter the 
functional activity of the HKNGl gene product. Variants 
include, but are not limited to, variants comprising the 
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polymorphisms summarized in FIGS. 5A-B and a variant which 
encodes a full length HKNGl polypeptide comprising a 
substitution of a lysine amino acid at amino acid residue 202 
of the HKNGl polypeptide shown in FIG. lA-lB and SEQ ID NO: 2 
g or the HKNGl amino acid residue 184 of the polypeptide of SEQ 
ID NO:4. 

With respect to the cloning of additional allelic 
variants of the human HKNGl gene and homologues and orthologs 
from other species (e.g., guinea pig, cow, mouse), the 
isolated HKNGl gene sequences disclosed herein may be labeled 
and used to screen a cDNA library constructed from mRNA 
obtained from appropriate cells or tissues (e.g., brain or 
retinal tissues) derived from the organism (e.g., guinea pig, 
cow, and mouse) of interest. The hybridization conditions 
used should generally be of a lower stringency when the cDNA 
J.5 library is derived from an organism different from the type 
of organism from which the labeled sequence was derived, and 
can routinely be determined based on, e.g. . relative 
relatedness of the target and refemce organisms. 

Alternatively, the labeled fragment may be used t o 
screen a genomic library derived from the organism of 
20 interest, again, using appropriately stringent conditions. 
Appropriate stringency conditions are well Icnown to those of 
skill in the art as discussed above, and will vary 
predictably depending on the specific organisms from which 
the library and the labeled sequences are derived. For 
guidance regarding such conditions see, for example, 
25 Sambrook, et al., 1989, Molecular Cloning, A Laboratory 
Manual, Second Edition, Cold Spring Harbor Press, N.Y.; and 
Ausubel, et al., 1989-1999, Current Protocols in Molecular 
Biology, Green Publishing Associates and Wiley Inter science , 
N.Y., both of which are incorporated herein by reference in 
- their entirety. 
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Further, a HKNGl gene allelic variant may be isolated 
from, for example, human nucleic acid, by performing PCR 
using two degenerate oligonucleotide primer pools designed on 
the basis of amino acid sequences within the HKNGl gene 
5 product disclosed herein. The template for the reaction may 
be CDNA obtained by reverse transcription of mRNA prepared 
from, for example, human or non-human cell lines or tissue 
known or suspected to express a wild type or mutant HKNGl 
gene allele (such as, for example, brain cells, including 
brain cells from individuals having BAD) . m one embodiment 
10 the allelic variant is isolated from an individual who has a' 
fflWGJ-mediated disorder, such variants are described in the 
examples below. 

The PCR product may be subcloned and sequenced to ensure 
that the amplified sequences represent the sequences of a 
HKNGl gene nucleic acid sequence. The PCR fragment may then 
^5 be used to isolate a full length cDNA clone by a variety of 
methods. For example, the amplified fragment may be labeled 
and used to screen a bacteriophage cDNA library. 
Alternatively, the labeled fragment may be used to isolate 
genomic clones via the screening of a genomic library. 

PCR technology may also be utilized to isolate full 
length cDNA sequences. For example, RNA may be isolated • 
following standard procedures, from an appropriate cellular 
or tissue source (i.e., one known, or suspected, to express 
the HKNGl gene, such as, for example, brain tissue samples 
obtained through biopsy or post-mortem) . A reverse 
25 transcription reaction may be performed on the RNA using an 
oligonucleotide primer specific for the most 5« end of the 
amplified fragment for the priming of first strand synthesis 
The resulting RNA/DNA hybrid may then be "tailed" with 
guanines using a standard terminal transferase reaction the 
hybrid may be digested with RNAase H, and second strand' 
3P synthesis may then be primed with a poly-c primer. Thus 
CDNA sequences upstream of the amplified fragment may easily 
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be isolated. For a review of cloning strategies that may be 
used, see e.g., Sambrook et al., 1989, supra, or Ausubel et 
al., supra . 

A cDNA of an allelic, e.g. , mutant, variant of the HKNGl 
g gene may be isolated, for example, by using PGR, a technique 
that is well known to those of skill in the art. In this 
case, the first cDNA strand may be synthesized by hybridizing 
an oligo-dT oligonucleotide to mRNA isolated from tissue 
known or suspected to be expressed in an individual 
putatively carrying a mutant HKNGl allele, and by extending 
10 the new strand with reverse transcriptase. The second strand 
of the cDHA is then synthesized using an oligonucleotide that 
hybridizes specifically to the 5 • end of the normal gene. 
Using these two primers, the product is then amplified via 
PGR, cloned into a suitable vector, and subjected to DMA 
sequence analysis through methods well known to those of 
15 skill in the art. By comparing the DNA sequence of the 
mutant HKNGl allele to that of the normal HKNGl allele, the 
mutation (s) responsible for the loss or alteration of 
function of the mutant HKNGl gene product can be ascertained. 

Alternatively, a genomic library can be constructed 
using DNA obtained from an individual suspected of or known 
20 to carry a mutant HKNGl allele, or a cDNA library can be 
constructed using RNA from a tissue known, or suspected, to 
express a mutant HKNGl allele.. An unimpaired HKNGl gene or 
any suitable fragment thereof may then be labeled and used as 
a probe to identify the corresponding mutant HKNGl allele in 
25 such libraries. Clones containing the mutant HKNGl gene 
sequences may then be purified and subjected to sequence 
analysis according to methods well known to those of skill in 
the art. 

Additionally, an expression library can be constructed 
utilizing cDNA synthesized from, for example, RNA isolated 
30 from a tissue known, or suspected, to express a mutant HKNGl 
allele in an individual suspected of or known to carry such a 
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mutant allele, in this manner, gene products made by the 
putatively mutant tissue may be expressed and screened using 
standard antibody screening techniques in conjunction with 
antibodies raised against the normal HKNGl gene product, as 
^ described, below, in section 5.3. (For screening techniques 
see, for example, Harlow and Lane, eds., 1988, "Antibodies: A 
Laboratory Manual", Cold Spring Harbor Press, cold Spring 
Harbor.) 

in cases where a HKNGl mutation results in an expressed 
gene product with altered function (e.g., as a result of a 
10 missense or a frameshift mutation) , a polyclonal set of anti- 
HKNGl gene product antibodies are likely to cross-react with 
the mutant HKNGl gene product. Library clones detected via 
their reaction with such labeled antibodies can be purified 
and subjected to sequence analysis according to methods well 
known to those of skill in the art. 
^5 HKNGl mutations or polymorphisms can further be detected 

using PGR amplification techniques. Primers can routinely be 
designed to amplify overlapping regions of the whole HKNGl 
sequence including the promoter regulating region. In one 
embodiment, primers are designed to cover the exon-intron 
20 boundaries such that, coding regions can be scanned for 
mutations. Exemplary primers for analyzing HKNGl exons aire 
provided in Table i, of Section 5.6, below. 

The invention also includes nucleic acid molecules, 
preferably DNA molecules, that are the complements of the 
nucleotide sequences of the preceding paragraphs. 
25 In certain embodiments, the nucleic acid molecules of 

the invention are present as part of nucleic acid molecules 
comprising nucleic acid sequences that do not contain 
heterologous (e^, cloning vector or expression vector) 
sequences. In other embodiments, the nucleic acid molecules 
of the invention further comprise vector sequences, e.g. . 
30 Cloning vectors or expression vectors. 
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^•2. PROTErW PROPnCTB OF THE HKWGl GENE 

HKNGl gene products or peptide fragments thereof, can be 
prepared for a variety of uses. For example, such gene 
products, or peptide fragments thereof, can be used for the 
^ generation of antibodies, in diagnostic assays, or for the 
identification of other cellular or extracellular gene 
products involved in the regialation of HKNGl -mediated 
disorders, e^a^, neuropsychiatric disorders, such as BAD. 

The HKNGl gene products of the invention include, but 
are not limited to, human HKNGl gene products, e.g., 

^° polypeptides comprising the amino acid sequences depicted in 
FIGS. lA-lB, 2A-2B, 17, and 18 (i.e., SEQ ID N0S:2, 4, 51, 
and 66) . Olhe HKNGl gene products of the invention also 
include non-human, e.g., mammalian (such as bovine or guinea 
pig), HKNGl gene products. These include, but are not 

15 limited to, polypeptides comprising the amino acid sequences 
depicted in FIGS. 7-13 (i.e., SEQ ID NOS:39, 41, 43, 45, and 
49). 

HKNGl gene product, sometimes referred to herein as an 
-HKNGl protein" or "HKNGl polypeptide," includes those gene 
20 P^^oducts encoded by the HKNGl gene sequences depicted in 
FIGS. lA-lB, 2A-2B, 7-13, 17, and IB, as well as gene 
products encoded by other human allelic variants and non- 
human variants of HKNGl that can be identified by the methods 
herein described. Among such HKNGl gene product variants are 
gene products comprising HKNGl amino acid residues encoded by 

25 

the polymorphisms depicted in FIGS. 5A and SB. Such gene 
product variants also include a variant of the HKNGl gene 
product depicted in FIG. i (SEQ ID N0:2) wherein the amino 
acid residue Lys202 is mutated to a glutamic acid residue. 
Such HKNGl gene product variants also include a variant of 
30 the HKNGl gene product depicted in FIG. 2 (SEQ ID NO: 4) 
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5 



wherein the amino acid residue Lysi84 is mutated to a 
glutamic acid residue. 

In addition, HKNGl gene products may include proteins 
that represent functionally equivalent gene products. 
Functionally equivalent gene products may include, for 
example, gene products encoded by one of the HKNGl nucleic 
acid molecules described in Section 5.1, above. In preferred 
embodiments, such functionally equivalent HKNGl gene products 
are naturally occuring gene products. Functionally 
equivalent HKNGl gene products also include gene products 
10 that retain at least one of the biological activities of the 
HKNGl gene products described above, and/or which are 
recognized by and bind to antibodies (polyclonal or 
monoclonal) directed against HKNGl gene products. 

Equivalent HKNGl gene product may contain deletions, 
^5 including internal deletions, additions, including additions 
yielding fusion proteins, or substitutions of amino acid 
residues within and/or adjacent to the amino acid sequence 
encoded by the HKNGl gene sequences described, above, in 
Section 5.1. Generally, deletions will be deletions of 
single amino acid residues, or deletions of no more than 
20 about 2, 3, 4, 5, 10 or 20 amino acid residues, either 
contiguous or non-contiguous. Generally, additions or 
substitutions, other than additions that yield fusion 
proteins, will be. additions or substitutions of single amino 
acid residues, or additions or substitutions of no more than 
about 2, 3, 4, 5, 10 or 20 amino acid residues, either 
25 contiguous or non-contiguous. Preferably, these 

modifications result in a "silent" change, in that the change 
produces a HKNGl gene product with the same activity as the 
HKNGl gene product depicted in FIG. lA-lB, 2A-2B, 7-13, or 
17. 

- Amino acid substitutions may be made on the basis of 

similarity in polarity, charge, solubility, hydrophobicity 
hydrophilicity, and/or the amphipathic nature of the residues 
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involved. For example, nonpolar (hydrophobic) amino acids 
include alanine, leucine, isoleucine, valine, proline, 
phenylalanine, tryptophan, and methionine; polar neutral 
amino acids include glycine, serine, threonine, cysteine, 
tyrosine, asparagine, and glutamine; positively charged ' 
(basic) amino acids include arginine, lysine, and histidine; 
and negatively charged (acidic) amino acids include aspartic 
acid and glutamic acid. 

Alternatively, where alteration of function is desired 
addition (s), deletion (s) or non-conservative alterations can 
produce altered, including reduced-activity, HKNGl gene 
products, such alterations can, for example, alter one or 
more of the biological functions of the HKNGl gene product. 
Further, such alterations can be selected so as to generate 
HKNGl gene products that are better suited for expression, 
scale up, etc. in the host cells chosen. For example, 
15 cysteine residues can be deleted or substituted with another 
amino acid residue in order to eliminate disulfide bridges. 

As another example, altered HKNGl gene products can be 
engineered that correspond to variants of the HKNGl gene 
product associated with HJOTGI -mediated neuropsychiatric 
disorders such as BAD. Such altered HKNGl gene products 
include, but are not limited to, HKNGl proteins or peptides 
comprising substitution of a lysine residue for the wild-type 

^^"^^^^o ^^oid residue at HKNGl amino acid position 202 in 

FIG. lA-lB (SEQ ID NO: 2) or amino acid position 184 (SEQ ID 
NO: 4) in FIG. 2A-2B. 

23 HKNGl protein fragments and/or HKNGl peptides comprise 

at least as many contiguous amino acid residues as necessary 
to represent an epitope fragment (that is to be recognized by 
an antibody directed to the HKNGl protein) . For example, 
such protein fragments or peptides comprise at least about 8 
contiguous HKNGl amino acid residues from a full length HKNGl 

30 protexn. in alternate embodiments, the HKNGl protein 

fragments and peptides of the Invention can comprise about 
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10 20, 30, 40, 50, 60, 70, ao, 90, 100, 150, 200, 250, 300, 
350, 400, 450 or more contiguous amino acid residues of a 
HKMGl protein. 

Peptides and/or proteins corresponding to one or more 
^ domains of the HKNGl protein as well as fusion proteins in 
Which a HmGl protein, or a portion of a HKNGl protein such 
as a truncated HKNGl protein or peptide or a HKNGl protein 
domain, is fused to an unrelated protein are also within the 
scope of this invention. Such proteins and peptides can be 
designed on the basis of the HKNGl nucleotide sequence 
10 disclosed in Section 5.1, above, and/or on the basis of the 
HKNGl amino acid sequence disclosed in the Section. Fusion 
proteins include, but are not limited to, igFc fusions which 
stabilize the HKNGl protein or peptide and prolong half life 
in vivo; or fusions to any amino acid sequence that allows 
^3 the fusion protein to be anchored to the cell membrane; or 
fusions to an enzyme, fluorescent protein, luminescent 
protein, or a flag epitope protein or peptide which provides 
a marker function. 

The HKNGl protein, the HKNGl protein sequences described 
above can include a domain which comprises a signal sequence 

20 that targets the HKNGl gene product for secretion. As used 
herein, a signal sequence includes a peptide of at least " 
about 15 or 20 amino acid residues in length which occurs at 
the N-terminus of secretory and membrane-bound proteins and 
Which contains at least about 70% hydrophobic amino acid 
residues such as alanine, leucine, isoleucine, phenylalanine, 

25 proline, tyrosine, tryptophan, or valine, m a preferred 
eabodiaent, a signal sequence contains at least about 10 to 
40 amino acid residues, preferably about 19-34 amino acid 
residues, and has at least about 60-80%, more preferably 
65-75%, and more preferably at least about 70% hydrophobic 

_ residues, a signal sequence serves to direct a protein 
30 containing such a sequence to a lipid bilaycr. 
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In one embodiment, a HNKNGl protein contains a signal 
sequence at about amino acids 1 to 49 of SEQ ID NO: 2. In 
another embodiment, a HKNGl protein contains a signal 
sequence at about amino acids 30-49 of SEQ ID NO: 2. In yet 
another embodiment, a HKNGl protein contains a signal 
sequence at about amino acid residues 1 to 31 of SEQ ID NO: 4. 
In yet another embodiment, a HENGl protein contains a signal 
sequence at about amino acids 12-31 of SEQ ID NO: 4. The 
signal sequence is cleaved during processing of the mature 
protein. 

A signal sequence of a polypeptide of the invention can 
be used to facilitate secretion and isolation of the secreted 
protein or other proteins of interest. Signal sequences cire 
typically characterized by a core of hydrophobic amino acids 
which are generally cleaved from the mature protein during 
secretion in one or more cleavage events. Such signal 
peptides contain processing sites that allow cleavage of the 
signal sequence from the mature proteins as they pass through 
the secretory pathway. Thus, the invention pertains to the 
described HKNGl polypeptides having a signal sequence (that 
is, "immature" polypeptides), as well as to the HKNGl signal 
sequences themselves and to the HNKGl polypeptides in the 

2® absence of a signal sequence (i.e., the "mature" HKNGl 
cleavage products) . It is to be understood that HKNGl 
polypeptides of the invention can further comprise 
polypeptides comprising any signal sequence having 
characteristics as described above emd a mature HKNGl 

25 polypeptide sequence. 

In one embodiment, a nucleic acid sequence encoding a 
signal sequence of the invention can be operably linked in an 
expression vector to a protein of interest, such as a protein 
which is ordinarily not secreted or is otherwise difficult to 
isolate. The signal sequence directs secretion of the 

3Q protein, such as from a eukaryotic host into which the 

expression vector is transformed, and the signal sequence is 
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subsequently or concurrently cleaved. The protein can then 
be readily purified from the extracellular medium by art 
recognized methods. Alternatively, the signal sequence can 
be linked to the protein of interest using a sequence which 
facilitates purification, such as with a GST domain. 

The HKNGl protein sequences described above can also 
include one or more domains which comprise a clusterin 
domain, i.e., domains which are identical to or substantially 
homologous to (i^, 65%, 75%, 80%, 85%, 90%, 95% or more 
identical to) the domain corresponding to amino acid residues 

°^ ^ino acid residues 334 to 362 of SEQ ID N0:2, 
or to the domain corresponding to amino acid residues 105-131 
or amino acid residues 305-333 of SEQ ID No: 39, or to the 
domain corresponding to amino acid residues 105-131 or amino 
acid residues 304-332 of SEQ ID NO:49. Preferably, such 
domains comprise cysteine amino acid residues at positions 
corresponding to conserved cysteine residues of the clusterin 
domains of SEQ ID NOS: 2, 39 or 49. 

In particular, HKNGl protein sequences described above 
can also include one or more domains which comprise a 
conserved cysteine domain. Such a domain corresponds, for 
example, to the domain of cysteines corresponding to Cysl34, 
Cysl45, Cysl48, Cysl58 and Cysl60; or to Cys 334, Cys344, 
Cys351, Cys354, and Cys362 of SEQ ID N0:2. In an alternative 
embodiment, a conserved cystein domain corresponds to one or 
more of the domains of SEQ ID NO: 39. which, comprises Cysios, - 
Cyslie, Cysll9, Cysl24, and Cysl31; or Cys305, Cys315, 
Cys322, Cys325, and Cys333. In yet another alternative 
embodiment, a conserved cysteine domain corresponds to one or 
more of the domains of SEQ ID NO: 49 which comprises Cysl05, 
Cyslie, Cysll9, Cysl24, and Cysl3l; or Cys314, Cys321, 
Cys324, and Cys332. 

Finally, the HKNGl proteins of the invention also 
include HKNGl protein sequences wherein domains encoded by 
one or more exons of the cDNA sequence, or fragments ther of, 
have been deleted. In one particularly preferred embodiment. 
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the HKNGi proteins of the invention are proteins in which the 
doBaxn(s) corresponding the those domains encoded by exon 7 
Of SEQ ID NO: 7, or fragments thereof, have been deleted 
The HKNGI polypeptides of the invention can further 
^ comprise posttranslational modifications, including, but not 
ixmxted to glycosylations, acetylations, and myrisalations 

The HKNGl gene products, peptide fragments thereof and 
fusion proteins thereof, may be produced by recombinant DNA 
technology using techniques well known in the art. Thus 
methods for preparing the HKNGl gene products, polypeptides 
XO peptides, fusion peptide and fusion polypeptides of the 
invention by expressing nucleic acid containing HKNGl gene 
sequences are described herein. Methods that are well known 
to those Skilled in the art can be used to construct 
expression vectors containing HKNGl gene product coding 
sequences and appropriate transcriptional and translational 
control signals. These methods include, for example, in 
vitro recombinant DNA techniques, synthetic techniques, and 
in vivo genetic recombination. See, for example, the 
techniques described in Sambrook, et ai., 1989, supra, and 
^ Ausubel, et al., i989, supra. Alternatively, rna capable of 
encoding HKNGl gene product sequences may be chemically 
synthesized using, for example, synthesizers. See, for 
example, the techniques described in "Oligonucleotide 
synthesis", 1984, Gait, ed., IRL Press, Oxford. 

A variety of host-expression vector systems may be 
utilxzed to express the HKNGl gene product coding sequences 
Of the invention. Such host-expression systems represent 
vehicles by which the coding sequences of interest may be 
produced and subsequently purified, but also represent cells 
that may, when transformed or transfected with the 
appropriate nucleotide coding sequences, exhibit the HKNGl 
30 gene product of the invention In situ. These include but are 
not limited to microorganisms such as bacteria (e.g., s. 
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B transformed with r«»^i„,„t >»cteriopta,e 

fflWSi gene product coding sequences; yeast (e.g. 
^cc*aro.yces, PicWe, transformed vlth recombW yeast 
s expression vectors containing the ^« gene product coding 

viT':::re:sT^ ^^^^^ -^^^ reco^inant 

Z: containing tne 

BK^ai gene product coding sequences; plant cell systems 
anfected with recombinant virus expression vectors (e g 
cauliflower mosaic virus r^^wv. 4- w 

4. ^ Virus, CaMV; tobacco mosaic virus tmvi «^ 

Tx piasmid) containing fflofci gene product coding 
sequences; or mammalian cell systems (e.g., cos, CHo, bhk, 

IS ^ll , "•''^'^^ "^ived from the genome of mammalian 
IS cells (e.g., metallothionein promoter, or from mammallT 

viruses (e.g., the adenovirus late promoter; the vaccinia 

vxrus 7 . 5K promoter) . 

.e ad^t1:r ly s^re; d'eZT " "^""^ - 

„ .or the gen^e prod^ ZZZr^J^'-'^T 

When a large quantity of such a protein is to be produced 
for the generation of Pharmaceutical compositions of 
gene product or for raising antibodies to ge„e product 

for example, vectors that direct the expression of high 
levels Of fusion protein products that are readllv nu!L, . 
25 mav be d^^zir-^M^ ^ reaaiiy purxfled 

to t^ I T «e not limited 

to the ^. coix expression vector p«R2,8 ,R„ther et ai 1^3 

^ J- 2 = 17«, , in Which the BKNai gene product codi.!^ ' 
sequence may be ligated individually Into the vector in frame 
vxth the coding region so that a fusion protein Is 

/ van neexe and Schuster, 1989, j. Biol. 
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Chem. 264:5503-5509); and the like. pGEX vectors may also be 
used to express foreign polypeptides as fusion proteins with 
glutathione S-transf erase (GST). In general, such fusion 
proteins are soluble and can easily be purified from lysed 
cells by adsorption to glutathione-agarose beads followed by 
S elution in the presence of free glutathione. The pGEX 
vectors are designed to include thrombin or factor Xa 
protease cleavage sites so that the cloned target gene 
product can be released from the GST moiety. 

In an insect system, Autographa californica, nuclear 
polyhidrosis virus (AcNPV) is used as a vector to express 
foreign genes. The virus grows in Spodoptera fruglperda 
cells. The HKNGl gene product coding sequence may be cloned 
individually into non-essential regions (for example the 
polyhedrin gene) of the virus and placed under control of an 
ACNPV promoter (for example the polyhedrin promoter) . 
15 Successful insertion of EKNGl gene product coding sequence 
will result in inactivation of the polyhedrin gene and 
production of non-occluded recombinant virus (i.e., virus 
lacking the proteinaceous coat coded for by the polyhedrin 
gene) . These recombinant viruses are then used to infect 
Spodoptera frugiperda cells in which the inserted gene is 
expressed. (e.g., see Smith, et al., 1983, J. virol. 46:584; 
Smith, U.S. Patent No. 4,215,051). 

. host cens, a number of viral-based 

expression systems may be utilized, in cases where an 
adenovirus is used as an expression vector, the EKNGl gene 
25 product coding sequence of interest may be ligated to an 
adenovirus transcription/translation control complex, e.g., 
the late promoter and tripartite leader sequence. This 
chimeric gene may then be inserted in the adenovirus genome 
by in vitro or in vivo recombination. Insertion in a non- 
essential region of the viral genome (e.g., region El or E3) 
will result in a recombinant virus that is viable and capable 
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Of repressing hk»gi gene product In Infected hosts. (e g 
see Logan and shank, l<,84, Proc. Hatl. Acad. scl. OSA " 
»l:3655-3659). Specific Initiation signals may also be 
required for efficient translation of inserted mci gene 
, product coding sequences. These signals include the ATG 
initiation codon and adjacent sequences, in cases „he« an 
entire .K«ai gene, including its o» Initiation codon ^d 
adjacent sequences, is inserted into the appropriate 
expression vector, no additional translatlonal control 
signals may be needed. However, in cases where only a 
„ portion of the HI^ci gene coding sequence is inserted, 

"r^"nm«''"°"^' signals, including, perhaps, 

the ATG initiation codon, must be provided. Furthermore, tl^e 
initiation codon must be in phase with the reading frame of 
the desired coding sequence to ensure translation of the 

X5 »d Lt'T- translatlonal control signals 

15 and initiation codons can be of a variety of origins boL 

enhanced by the inclusion of appropriate transcription 
enhancer elements, transcription terminators, etc. (see 
Bittner, et al., war. Methods In Enzyaol. 163:516-544) 

" moduJt " """'^ "'''^ 

modulates the e:tpressio„ of the Inserted sequences, or 

modifies and processes the gene product in the specif Ic 
fashion desired, such modifications (e.g., glycosylation, 
and processing (e.g. , cleavage) of protein products may be 

c*^rr* °' ""^'^t host 

„ cells have characteristic and specific mechanisms for the 
post-translatlonal processing and modification of proteins 
and gene products. Appropriate cell lines or host sysZ 
can be chosen to ensure the correct modification and 
processing of the foreign protein e^ressed. To this end 

- ""^ "-^^ ^™ ,achin:;'for 

" »d^ '''^-'"^ °' tte primary transcript, glycosylation, 
and Phosphorylation of the gene product may be used! Such 
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meonmalian host cells include but are not limited to CHO, 
VERO, BHK, HeLa, COS, MDCK, 293, 3T3 , and WI38, 

For long-term, high-yi Id production of recombinant 
proteins, stable expression is preferred. For example, cell 
lines that stcODly express the HKNGl gene product may be 

5 

engineered. Rather than using expression vectors that 
contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer, sequences, 
transcription terminators, polyadenylation sites, etc.)/ and 
a selectable marker. Following the introduction of the 
foreign DNA, engineered cells may be allowed to grow for 1-2 
days in an enriched media, and then are switched to a 
selective media- The selectable marker in the recombinant 
plasmid confers resistance to the selection and allows cells 
to stably integrate the plasmid into their chromosomes aind 
grow to form foci that in turn can be cloned and expanded 
into cell lines. This method may advantageously be used to 
engineer cell lines that express the HKNGl gene product. 
Such engineered cell lines may be particularly useful in 
screening and evaluation of compounds that affect the 
endogenous activity of the HKNGl gene product. 
20 A number of selection systems may be used, including -but 

not limited to the herpes simplex virus thymidine kinase 
(Wigler, et al., 1977, Cell 11:223), hypoxanthine-guanine 
phosphoribosyltransf erase (Szybalska and Szybalski, 1962, 
Proc. Natl. Acad. Sci. USA 48:2026), and adenine 
phosphoribosyltransf erase (Lowy, et al., 1980, Cell 22:817) 

25 

genes can be employed in tk", hgprt" or aprf cells, 
respectively. Also, antimetabolite resistance can be used as 
the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler, et al., 1980, 

^ Natl. Acad. Sci. USA 77:3567; O'Hare, et al., 1981, Proc. 

30 Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance 
to mycophenolic acid (Mulligan and Berg, 1981, Proc. Natl. 
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Acad. Sci. USA 78:2072); neo, which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin, et al., 1981, J. Mol. 
Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al., 1984, Gene 30:147). 
g Alternatively, the expression characteristics of an 

endogenous HKNGl gene within a cell line or microorganism may 
be modified by inserting a heterologous DNA regulatory 
element into the genome of a stable cell line or cloned 
microorganism such that the inserted regulatory element is 
operatively linked with the endogenous HKNGl gene. For 

10 exan^le, an endogenous HKNGl gene which is normally 

"transcriptionally silent", i.e., an HKNGl gene which is 
normally not expressed, or is expressed only at very low 
levels in a cell line or microorganism, may be activated by 
inserting a regulatory element which is capable of promoting 

^ the expression of a normally expressed gene product in that 
cell line or microorgeinism. Alternatively, a 
transcriptionally silent, endogenous HKNGl gene may be 

activated by insertion of a promiscuous regulatory element 
that works across cell types. 

A heterologous regulatory element may be inserted into a 
stable cell line or cloned microorganism, such that it is 
operatively linked with an endogenous HKNGl gene, using 
techniques, such as targeted homologous recombination, which 
are well known to those of skill in the art, and described 
e.g., in Chappel, U.S. Patent No. 5,272,071; PCT publication 
No. WO 91/06667, published May 16, 1991. 
25 Alternatively, any fusion protein may be readily 

purified by utilizing an antibody specific for the fusion 
protein being expressed. For example, a system described by 
Janknecht, et al. allows for the ready purification of non- 
denatured fusion proteins expressed in human cell lines 
(Janknecht, et al., 1991, Proc. Natl. Acad. Sci. USA 88:8972- 

30 

8976). In this system, the gene of interest is subcloned 
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into a vaccinia recombination plasmid such that the gene's 
open reading frame is translationally fused to an amino- 
terminal tag consisting of six histidine residues. Extracts 
from cells infected with recombinant vaccinia virus are 
loaded onto Ni^^-nitriloacetic acid-agarose columns and 
5 histidine-tagged proteins are selectively eluted with 
imidazole-containing buffers. 

The BKNGl gene products can also be expressed in 
transgenic animals. Animals of any species, including, but 
not limited to, mice, rats, rabbits, guinea pigs, pigs, 
micro-pigs, goats, sheep, cows, and non-human primates, e.g., 
baboons, monkeys, and chimpanzees may be used to generate 
HKNGl transgenic animals. The term "transgenic," as used 
herein, refers to animals expressing HKNGl gene sequences 
from a different species (e.g., mice expressing huma mOTGl 
gene sequences), as well as animals that have been 
15 genetically engineered to overexpress endogenous (i.e., same 
species) HKNGl sequences or animals that have been 
genetically engineered to no longer express endogenous HKNGl 
gene sequences (i.e., "knock-out" animals), and their 
progeny. 

20 Any technique known in the art may be used to introduce 

a HKNGl gene transgene into animals to produce the founder 
lines of transgenic animals. Such techniques include, but 
are not limited to pronuclear microinjection (Hoppe and 
Wagner, 1989, U.S. Pat. No. 4,873,191); retrovirus mediated 
gene transfer into germ lines (Van der Putten, Bt al., 1935, 
Proc. Natl. Acad. Sci., USA 82:6148-6152); gene targeting in 
embryonic stem cells (Thompson, et al., 1989, Cell 56:313- 
321); electroporation of embryos (Lo, 1983, Mol. Cell. Biol. 
3:1803-1814); and sperm-mediated gene transfer (Lavitrano et 

- al., 1989, Cell 57:717-723) (For a review of such techniques, 

30 see Gordon, 1989, Transgenic Animals, Intl. Rev. Cytol. 115, 
171-229) 
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Any technique known in the art may be used to produce 
transgenic animal clones containing a HKNGl transgene^ for 
example, nuclear transfer into enucleated oocytes of nuclei 
from cultured embryonic, fetal or adult cells induced to 
^ quiescence (Campbell, et al., 1996, Nature 380:64-66; Wilmut, 
et al., Natxire 385:810-813). 

The present invention provides for transgenic animals 
that carry a HKNGl transgene in all their cells, as well as 
animals that carry the transgene in some, but not all their 
cells, i.e., mosaic animals. The transgene may be integrated 
as a single transgene or in concatamers, e.g., head-to-head 
tandems or head-to-tail tandems. The transgene may also be 
selectively introduced into and activated in a particular 
cell type by following, for example, the teaching of Lasko et 
al. (Lasko, et al., 1992, Proc. Natl. Acad. Sci. USA 89:6232- 
^5 6236). The regulatory sequences required for such a cell- 
type specific activation will depend upon the particular cell 
type of interest, and will be apparent to those of skill in 
the art. When it is desired that the HKNGl transgene be 
integrated into the chromosomal site of the endogenous HKNGl 
gene, gene targeting is preferred. Briefly, when such a 
technique is to be utilized, vectors containing some 
nucleotide sequences homologous to the endogenous HKNGl gene 
are designed for the purpose of integrating, via homologous 
recombination with chromosomal sec[uences, into an€L disrupting 
the function of the nucleotide sequence of the endogenous 
HKNGl gene. The transgene may also be selectively introduced 

2 9 

into a particular cell type, thus inactivating the endogenous 
HKNGl gene in only that cell type, by following, for example, 
the teaching of Gu, et al. (Gu, et al., 1994, Science 265, 
103-106) . The regulatory sequences required for such a cell- 
- type specific inactivation will depend upon the particular 
30 cell type of interest, and will be apparent to those of skill 
in the art. 
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Once transgenic animals have been generated, the 
expression of the recombinant HKNGl gene may be assayed 
utilizing standard techniques* Initial screening may be 
accomplished by Southern blot analysis or PGR techniques to 
analyze animal tissues to assay whether integration of the 
transgene has taken place. The level of mRNA expression of 
the transgene in the tissues of the transgenic animals may 
also be assessed using techniques that include but are not 
limited to Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and RT-PCR 
(reverse transcriptase PGR) . Samples of HKNGl gene- 
expressing tissue, may also be evaluated immunocytochemically 
using antibodies specific for the HKNGl transgene product. 

HKNGl proteins can be used, e.g. , to treat CNS-related 
disorders, e.g. , neuropsychiatric disorders. Such HKNGl gene 
products include but are not limited to soluble derivatives 
such as peptides or polypeptides corresponding to one or more 
domains of the HKNGl gene product, particularly HKNGl gene 
products, that core modified such that they are deleted for 
one or more hydrophobic domains. Alternatively, antibodies 
to the HKNGl protein or anti-idiotypic antibodies that mimic 

20 the HKNGl gene product (including Fab fragments) , antagonists 
or agonists can be used to treat neuropsychiatric disorders 
involving HKNGl. In yet another approach, nucleotide 
constructs encoding such HKNGl gene products can be used to 
genetically engineer host cells to express such HKNGl gene 

25 P^od^^ts in vivo: these genetically engineered cells can 

function as "bioreactors" in the body delivering a continuous 
supply of HKNGl gene product, HKNGl peptides, soliible HKNGl 
polypeptides . 

5,3. ANTIBODIES TO HKNGl GENE PRODUCTS 
30 Described herein are methods for the production of 

antibodies capable of specifically recognizing one or more 
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SKNGl gene product epitopes or epitopes of conserved variants 
or peptide fragments of the HKNGi gene products. Further, 
antibodies that specifically recognize mutant forms of snfGl, 
are encompassed by the invention. The terms "specifically 
S bind" and "specifically recognize" refer to antibodies that 
bind to EKNGl gene product epitopes at a higher affinity than 
they bind to non-HKNGl (e^, random) epitopes. 

such antibodies may include, but are not limited to 
polyclonal antibodies, monoclonal antibodies (mAbs) 
humanized or chimeric antibodies, single chain antibodies. 
Fab fragments, P(ab«), fragments, fragments produced by a Fab 
expression library, anti-idiotypic (anti-id) antibodies, and 
epitope-binding fragments of any of the above, including the 
polyclonal and monoclonal antibodies described in Section 12 
below, such antibodies may be used, for example, in the 
detection of a HKNGl gene product in an biological sample and 
may, therefore, be utilized as part of a diagnostic or 
prognostic technique whereby patients may be tested for 
abnormal levels of HKNGl gene products, and/or for the 
presence of abnormal forms of such gene products, such 
antibodies may also be utilized in conjunction with, for 
2p example, compound screening schemes, as described, below, in 
Section 5.8, for the evaluation of the effect of test 
compounds on SKNGl gene product levels and/or activity. 
Additionally, such antibodies can be used in conjunction with 
the gene therapy techniques described, below, in Section 
5.9.2 to, for example, evaluate the normal and/or engineered 
25 inorGI-expressing cells prior to their introduction into the 
patient. 

Anti-EKNGl gene product antibodies may additionally be 
used in methods for inhibiting abnormal HKNGl gene product 
_ activity. Thus, such antibodies may, therefore, be utilized 
as part of treatment methods for a fflOTGl-mediated 
neuropsychiatric disorder, such as BAD or schizophrenia. 



- 44 - 



wo 99/47535 



PCT/US99/05606 



For the production of antibodies against a HENGl gene 
product, various host animals may be immunized by injection 
with a HKNGl gene product, or a portion thereof. Such host 
animals may include, but are not limited to rabbits, mice, 
g and rats, to name but a few. Various adjuvants may be used 
to increase the immunological response, depending on the host 
species, including but not limited to Freund's (complete and 
incomplete) , mineral gels such as aliuninum hydroxide , surface 
active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, dinitrophenol , and potentially useful human 
adjuvants such as BCG (bacille Calmette-Guerln) and 
Coiyneiiacterium parvum. 

Polyclonal antibodies eire heterogeneous populations of 
antibody molecules derived from the sera of animals immvmized 
with an antigen, such as a HKNGl gene product, or an 
15 antigenic functional derivative thereof. For the production 
of polyclonal antibodies, host animals such as those 
described above, may be immunized by injection with HKNGl 

gene product supplemented with adjuvants as also described 
above . 

Monoclonal antibodies, which are homogeneous populations 
20 of antibodies to a particular antigen, may be obtained by any 
technique that provides for the production of antibody 
molecules by continuous cell lines in culture. These 
include, but are not limited to, the hybridoma technique of 
Kohler and Milstein, (1575, Nature 256:495-497; and U.S. 
Patent No. 4,376,110), the human B-cell hybridoma technique 
25 (Kosbor et al., 1983, Immunology Today 4:72; Cole et aJ., 
1983, Proc. Natl. Acad. Sci. USA 80:2026-2030), and the EBV- 
hybridoma technique (Cole et al., 1985, Monoclonal Antibodies 
And Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Such 
antibodies may be of any immunoglobulin class including igG, 
3Q IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma 
producing the mAb of this invention may be cultivated in 
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Vitro or in vivo. Production of high titers of mAbs in vivo 
makes this the presently preferred method of production. 

in addition, techniques developed for the production of 
"chimeric antibodies" (Morrison, et al., 1984, Proc. Natl. 
5 Acad. Sci., 81:6851-6855; Neuberger, et al,, 1984, Nature 
312:604-608; Takeda, et al., 1985, Nature, 314:452-454) by 
splicing the genes from a mouse antibody molecule of 
appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity 
can be used. A chimeric antibody is a molecule in which 
10 different portions are derived from different animal species 
such as those having a variable region derived from a murine' 
mAb and a human immunoglobulin constant region. (See, e.g., 
cabilly et al., U.S. Patent No. 4,816,567; and Boss et al." 
U.S. Patent No. 4,816397, which are incorporated herein by 
reference in their entirety.) 
^5 In addition, techniques have been developed for the 

production of humanized antibodies. (See, e.g.. Queen, U.S. 
Patent Ho. 5,585,089, which is incorporated herein by 
reference in its entirety.) An immunoglobulin light or heavy 
chain variable region consists of a "framework" region 
interrupted by three hypervariable regions, referred to as 
complementarily determining regions (CDRs) . The extent of 
the framework region and CDRs have been precisely defined 
(see, "Sequences of Proteins of Immunological Interest", 
Rabat, E. et al., u.S. Department of Health and Human 
services (1983) ). Briefly, humanized antibodies are 
antibody molecules from non-human species having one or more 
CDRs from the non-human species and a framework region from a 
human immunoglobulin molecule; 

Alternatively, techniques described for the production 
of single chain antibodies (U.S. Patent 4,946,778; Bird, 
1988, Science 242:423-426; Huston, et al., 1988, Proc. Natl. 
30 Acad. sci. USA 85:5879-5883; and Ward, et al., 1989, Nature 
334:544-546) can be adapted to produce single chain 
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antibodies against HKNGl gene products. Single chain 
antibodies are formed by linking the heavy and light chain 
fragments of the Pv region via an amino acid bridge, 
resulting in a single chain polypeptide. 
^ Antibody fragments that recognize specific epitopes may 

be generated by known techniques. For example, such 
fragments include but are not limited to: the F(SLb'), 
fragments, which can be produced by pepsin digestion of the 
antibody molecule and the Fab fragments, which can be 
generated by reducing the disulfide bridges of the F(ab«)j 
fragments. Alternatively, Fab expression libraries may be 
constructed (Huse, et al., 1989, Science 246:1275-1281) to 
allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 

5.4. USES OP HENGl 6EKE SEQUEHCEB 
SENB PRQ OUerg. AMD MmTBPnTFiR 

Described herein are various applications of HKNGl gene 
sequences, HKNGl gene products, including peptide fragments 
and fusion proteins thereof, and of antibodies directed 
against HKNGl gene products and peptide fragments thereof. 
Such applications include, for example, mapping of chromosome 
18p, prognostic and diagnostic evaluation of HKWGJ -mediated 
disorders, including CNS-related disorders, e.g. . 
neuropsychiatric disorders, such as BAD or schizophrenia, 
modulation of ffiWGI-related processes, and the identification 
of subjects with a predisposition to such disorders, as 
25 described, below, in Section 5.5. 

Additionally, such applications include methods for the 
treatment of a iSCffGl -mediated disorders, such as BAD or 
schizophrenia, as described, below, in Section 5.9, and for 
the identification of compounds that modulate the expression 
. of the HKNGl gene and/or the synthesis or activity of the 
30 HKNGl gene product, as described below, in Section 5.8. Such 
compounds can include, for example, other cellular products 
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that are involved in such processes as mood regulation and in 

JJOTGl -mediated disorders e „ w m 

"isoraers, e^, neuropsychiatric disorders 

such as BAD or schi^ophrenia. These compounds can be used 
for example, in the amelioration of fiiWGl-mediated disorders 
5 and for the modulation of ffJOTGI-mediated processes, 

uses of the HKNGl gene sequences, HKNGl gene products 
including peptide fragments and fusion proteins thereof, and 
Of antxbodxes directed against HKNGl gene products and/or 
peptide fragments thereof also include prognostic and 
diagnostic evaluation of a fflO^Gi -mediated myopia disorder 
such as early-onset autosomal dominant myopia, methods for 
the treatment of a FiT^Gi-mediated myopia disorder, and for 
the identification of compound that modulate the expression 
Of the HKNGl gene and/or the synthesis or activity of the 
HKNGl gene product and could therefore be used in the 
15 amelioration of a ^in.«-mediated myopia such as early-onset 
autosomal dominant myopia, indeed, such methods are 
substantially identical to the methods described, below in 
Scions 5.5, 5.8, and 5.9 for the diagnosis and treatment of 
HKWGi-mediated disorders. 

20 

5.5. DIAGNOSIS OP HKWGI -MEDIATED 
PISORDBRfi 

A variety of methods can be employed for the diagnostic 
and prognostic evaluation of ffJCi^Gl-mediated disorders, e.g. 
neuropsychiatric disorders and for the identification of 

25 subjects having a predisposition to such disorders. 

such methods may, for example, utilize reagents such as 
the HKNGl gene nucleotide sequences described in Sections 
5.1, and antibodies directed against HKNGl gene products 
including peptide fragments thereof, as described, above' in 

- section 5.3. specifically, such reagents may be used, for 

30 example, for: 



- 48 - 



wo 99/47535 



PCT/US99/05606 



(1) the detection of the presence ot fflORji gene 
Mutations, or the detection of either over- or under- 
e^rassicn of gene relative to wild-type Hma, levels 

Of expression; 

S (2) the detection of over- or under-abundance of MKNGl 

9ene product relative to wild-type abundance of ,K«az gene 
product; and 

(3) the detection of an aberrant level of HKNGi gene 
product activity relative to wild-type HK^ai gene product 
activity levels. 

BK„S1 gene nucleotide eeguences can, for example, be 
used to diagnose a OTGZ-„ediated neuropsychiatric disorder 
using, for example, the techniques for Hmci 

sT'Z'ZTT"' ^" «-tion 

3.1, and in Section 5.6 below 

" to ph::::^Tr:L L^T"^ - -i«erent genetic loci „ay lead 
T,,.!,, !r "^'^^'^ *° neuropsychiatric disorders. 
Ideally, the treatment of patients suffering from such 
-ropsychiatrlc disorder will be designed I targe^the 

M«« PolJ-O'TW^- have been linlcM to 

al^"' " ^ effectiveness. Thus, identification of 
alterations in the mai gene, protein or gene flanking 
regions, can be utilised In pharmacogenetic .ethods to 
optiBize therapeutic drug treatnents. 

alt-ra«ons T^"^^ °' ^ invention, therefore, 

alt^atlons, i.e., polyMorphlsms, in the Hmei gene or 

protein encoded by genes comprising such polymorphisms are 
llToT" T ^ °' ^^el,ce ;r 

^t^ile't^ P»a™=oge„omlc methods to 

t--tments, including therapeutic 
<irug treatments for one of the disorders described herein 

™.l-mediated disorders such as schizophrenia a^^ B^O. 
-such polymorphisms can be used, for example, to refine the 
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design of drugs by decreasing the incidence of adverse events 
m drug tolerance studies, e.g., by identifying patient 
subpopulations of individuals who respond or do not respond 
to a particular drug therapy in efficacy studies, wherein the 
^ subpopulatxons have a HKNGl poly^norphism associated with drug 
responsiveness or unresponsiveness. The phannacogenoxaic 
methods ot the present invention can also provide tools to 
xdentafy new drug targets for designing drugs and to optimize 
the use Of already existing drugs, e.g., to increase the 
response rate to a drug and/or to identify and exclude non- 
10 responders from certain drug treatments (e.g., individuals 
having a particular HKNGl polymorphism associated with 
unresponsiveness or inferior responsiveness to the drug 
treatment) or to decrease the undersireable side effects of 
certaxn drug treatments and/or to identify and exclude 
xndxvxduals with marked susceptibility to such side effects 
(e.g., individuals having a particular HKNGl polymorphism 

associated with an undersirable side effect to the drua 
treatment) . ^ 

in an embodiment of the present invention, polymorphisms 
xn the HKNGl gene sequence or flanking this sequence, or 
2P variations in HKNGl gene expression, or activity, ^ 

variations due to altered methylat ion, differential splicing 
or post-translational modification of the HKNGl gene product' 
may be utilized to identify an individual having a disease o^ 
condition resulting from a fflWGl-mediated disorder and thus 
define the most effective and safest drug treatment. Assays 
such as those described herein may be used to identify such 
polymorphisms or variations in HKNGl gene expression or 
activity, onbe a polymorphism in the HKNGl gene or in a 
flanking sequence in linkage disequilibrium with a disorder- 
_ causing allele, or a variation in HKNGl gene expression has 
30 been identified in an individual, an appropriate drug 
treatment can be prescribed to the individual. 
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For the detection of HKNGl gene mutations or 
polymorphisms, any nucleated cell can be used as a starting 
source for genomic nucleic acid. For the detection of HmGl 
gene expression or SKNGl gene products, any cell type or 
5 tissue in which the HKNGl gene is expressed may be utilized. 
Nucleic acid-based detection techniques are described 
below, in section 5.6. Peptide detection techniques are 
described, below, in Section 5.7. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits. The 
xo invention therefore also encompasses kits for detecting the 
presence of a polypeptide or nucleic acid of the invention in 
a biological sample (i.e., a test sample). Such kits can be 
used, e.g., to determine if a subject is suffering from or is 
at increased risk of developing a disorder associated with a 
disorder-causing allele, or aberrant expression or activity 
of a polypeptide of the invention (e.g., a CNS disorder, 
including a neurospychiatric disorder such as BAD or 
schizophrenia). For example, the kit can comprise a labeled 
compound or agent capable of detecting the polypeptide or 
aRNA or DNA or BKNGl gene sequences, encoding the 

20 polypeptide in a biological sample. The kit can further 
comprise a means for determining the amount of the 
polypeptide or mRNA in the sample (e.g., an antibody which 
binds the polypeptide or an oligonucleotide probe which binds 
to DHA or mSNA encoding the polypeptide) . Kits can also 
include instructions for observing that the tested subject is 
25 suffering from or is at risk of developing a disorder 

associated with aberrant expression of the polypeptide if the 
amount of the polypeptide or mRMA encoding the polypeptide is 
above or below a normal level, or if the DNA correlates with 
presence of a HKNGl allele that causes a disorder. 

For antibody-based kits, the kit can comprise, for 
example: (i) a first antibody (e.g., attached to a solid 
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support) which binds to a polypeptide of the invention; and, 
optionally, (2) a second, different antibody which binds to' 
either the polypeptide or to the first antibody and is 
conjugated to a detectable agent. 

For oligonucleotide-based kits, the kit can comprise, 
5 for exampler (i) an oligonucleotide (e.y., a detectably ' 
labeled oligonucleotide) which hybridizes to a nucleic acid 
sequence encoding a polypeptide of the invention, or (2) a 
pair of primers, such as the primers recited in Table l, 
useful for amplifying a nucleic acid molecule encoding a 
polypeptide of the invention. 

10 

The kit can also comprise, for example, one or more 
buffering agents, preservatives, or protein stabilizing 
agents. The kit can also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 
substrate). The kit can also contain a control sample or a 
series of control samples which can be assayed and compared 
to the test sample. Each component of the kit is usually 
enclosed within an individual container and all of the 
various containers are within a single package along with 
instructions for observing whether the tested subject is 
suffering from or is at risk of developing a disorder 
2 J, associated with polymorphisms that correlate with alleles 
that cause HJOTGl-related disorders, and/or aberrant levels of 
HKNGl mRNA, polypeptides or activity. 

PgPECTION OP BITKai KUCLBTG ^IQID MOI.ECnr.Bfi 

A variety of methods can be employed to screen for the 
presence of HKNGl gene-specific mutations or polymorphisms 
(including polymorphisms flanking HKNGl gene) and to detect 
and/or assay levels of HKNGl nucleic acid sequences. 

Mutations or polymorphisms within or flanking the HKNGl 
gene can be detected by utilizing a number of techniques. 
30 Miicleic acid from any nucleated cell can be used as the 

starting point for such assay techniques, and may be isolated 



25 
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according to standard nucleic acid preparation procedures 
that are well known to those of skill in the art. 

EXNGl nucleic acid sequences may be used in 
hybridization or amplification assays of biological saj^les 
^ to detect abnormalities involving HKNGl gene structure, 

including point mutations, insertions, deletions, inversions 
translocations and chromosomal rearrangements. Such assays ' 
may include, but are not limited to. Southern analyses, 
single-stranded conformational polymorphism analyses (SSCP) , 
and PGR analyses. ' 

Diagnostic methods for the detection of HKNGl gene- 
specific mutations or polymorphisms can involve for example, 
contacting and incubating nucleic acids obtained from a 
sample, e.g., derived from a patient sample or other 
appropriate cellular source with one or more labeled nucleic 
acid reagents including recombinant DNA molecules, cloned 
15 genes or degenerate variants thereof, such as described in 
Section 5.1, above, under conditions favorable for the 
specific annealing of these reagents to their complementary 
sequences within or flanking the HKNGl gene. The diagnostic 
methods of the present invention further encompass contacting 
and incubating nucleic acids for the detection of single 
nucleotide mutations or polymorphisms of the HKNGl gene. 
Preferably, these nucleic acid reagent sequences within the 
HKNGl gene, or chromosome I8p nucleotide sequences flanking 
the HKNGl gene are 15 to 30 nucleotides in length. 

After incubation, all non-annealed nucleic acids are 
25 removed from the nucleic acidzHKNGl molecule hybrid. The 
presence of nucleic acids that have hybridized, if any such 
molecules exist, is then detected. Using such a detection 
scheme, the nucleic acid from the cell type or tissue of 
interest can be immobilized, for example, to a solid support 
- such as a membrane, or a plastic surf ace. such as that on a 
30 microtiter plate or polystyrene beads. In this case, after 
incubation, non-annealed, labeled nucleic acid reagents of 
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10 



the type described in Section 5.1 are easily removed. 
Detection of the remaining, annealed, labeled HKNGl nucleic 
acid reagents is accomplished using standard techniques well- 
known to those in the art. The EKNGl gene sequences to which 
5 the nucleic acid reagents have annealed can be compared to 
the annealing pattern expected from a normal HKNGl gene 
sequence in order to determine whether a ffirWGI gene mutation 
is present. 

In a preferred embodiment, HKNGl mutations or 
polymorphisms can be detected by using a microassay of HKHGl 
nucleic acid sequences immobilized to a substrate or "gene 
Chip" (see, e.g. Cronin, et al., 1996, Human Mutation 7:244- 
255). 

Alternative diagnostic methods for the detection of 
HKNGl gene-specific nucleic acid molecules (or HKNGl flanking 

15 sequences) , in patient samples or other appropriate cell 
sources, may. involve their amplification, e.g., by PGR (the 
experimental embodiment set forth in Mullis, 1987, U.S. 
Patent No. 4,683,202), followed by the analysis of the 
amplified molecules using techniques well known to those of 
skill in the art, such as, for example, those listed above 

20 The resulting amplified sequences can be compared to those 
that would be expected if the nucleic acid being amplified 
contained only normal copies of the HKNGl gene in order to 
determine whether ^ HKNGl gene mutation or polymorphism in 
linkage disequilibrium with a disease-causing HKNGl allele 
25 exists. 

Among those HKNGl nucleic acid sequences which are 
preferred for such amplification-related diagnostic screening 
analyses are oligonucleotide primers which amplify HKNGl exon 
sequences. The sequences of such oligonucleotide primers 
- are, therefore, preferably derived from HKNGl intron 
30 sequences so that the entire exon, or coding region, can be 
analyzed as discussed below. Primer pairs useful for 
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amplification of HKNGl exons are preferably derived from 
adjacent introns. Appropriate primer pairs can be chosen 
such that each of the eleven EKNGl exons are amplified. 
Primers for the amplification of HKNGl exons can be routinely 
5 designed by one of ordinary skill in the art by utilizing the 
exon and intron sequences of HKNGl shown in Figure 3A-3R. 

As an example, and not by way of limitation. Table i 
below, lists primers and primer pairs which can be utilized 
for the amplification of each of the human HKGNl exons one 
through eleven, m this table, a primer pair is listed for 
each exon which consists of a forward primer derived from 
intron sequence upstream of the exon to be amplified, and a 
reverse primer derived from intron sequence downstream of the 
exon to be amplified. For exons greater than about 300 base 
pairs in length, exons 4 and 7, two primer pairs are 

listed (marked 4a, 4b, 7a and 7b) . Each of the primer pairs 
can be utilized, therefore, as part of a standard PCR 
reaction to amplify an individual HKNGl exon (or portion 
thereof). Primer sequences are depicted in a 5' to 3« 
orientation. 



20 



25 



30 
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1 " 

1 


1 Primer Seouence 1 


1 ^ 


^-^t-^cigT:;T:T:carggac (SEQ ID NO; 28) 


forward 




caaggtatacacgtcftcatttc fSEQ ID N0:29^ 


reverse 








10 


gaatgtgtattgggatttagtaaac (SEQ ID NO: 301 


forwcLcd 




ttgagaattaactattcctgtcaac (SEQ ID NO; 311 


reverse 


11 


ccatcctggacttttactcc (SEO ID NO: 321 
_ctttcctgcaactgtgtttatta fSEO ID NO:33^ 


forward 
reverse 



10 



Each primer pair above can be used to generate an 
amplified sequence of about 300 base pairs. This is 
especially desirable in instances in which sequence analysis 
is performed using SSCP gel electrophoretic procedures, in 
that such procedures work optimally using sequences of about 
25 300 base pairs or less. 

Additional SKNGl nucleic acid sequences which are 
preferred for such amplification-related analyses are those 
Which will detect the presence of an HKNGl polymorphism which 
differs from the SKNGl sequence depicted in FIG. 3A-3R. such 
polymorphisms include ones which represent mutations 
associated with an ^JWGl-mediated neuropsychiatric disorder, 
such as BAD or schizophrenia. For example, a single base 
mutation identified in the Example presented in Section 8 
below, results in a mutant HKNGl gene product comprising 
substitution Of a lysine residue for the wild-type glutamic 
25 acid residue at amino acid position 202 of the BKNGl amino 
acid sequence shown in pig. ia-1b (seq id NO; 2) or amino acid 
position 184 of the HKNGl amino acid sequence shown in FIG. 
2A-2B (SEQ ID N0:4) . Such polymorphisms also include ones 
that correlate with the presence of a HKNGl-mediated 
neuropsychiatric disorder, e.g., polymorphisms that are in 
linkage disequilibrium with disorder-causing HKNGl alleles. 
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Amplification techniques are well known to those of 
skill in the art and can routinely be utilized in connection 
with primers such as those listed in Table 1 above. In 
general, hybridization conditions can be as follows. 
In general, for probes between 14 and 70 nucleotides in 
^ length the melting temperature TM is calculated using the 
formula : Tm ( ^^C) =8 1 • 5+16 . 6 ( log [ monovalent cations ) ) +0. 4 1 ( % 
G+C)-(500/lI) where N is the length of the probe. If the 
hybridization is carried out in a solution containing 
formamide, the melting temperature is calculated using the 
equation Tm(°C)=81.5+16.6 (log[monovalent cations] )+0. 41 (% 
^0 G+C)-(0.61% formamide) -{500/N) where N is the length of the 
probe . 

Additionally, well-known genotyping techniques can be 
performed to identify individuals carrying HKNGl gene 
mutations. Such techniques include, for example, the use of 

15 ^^^^^^^'^^^^^ fragment length polymorphisms (RFLPs) , which 
involve sequence variations in one of the recognition sites 
for the specific restriction enzyme used. 

Further, improved methods for analyzing DNA 
polymorphisms, which can be utilized for the identification 
of EKNGl gene-specific mutations, have been described that 

20 ^^Pi'*^^^i2e on the presence of variable numbers of short, 
tandemly repeated DNA sequences between the restriction 
enzyme sites. For example, Weber (U.S. Pat. No. 5,075,217) 
describes a DNA marker based on length polymorphisms in 
blocks of (dC-dA)n-(dG-dT)n short tandem repeats. The 
average separation of (dC-dA) n- (dG-dT) n blocks is estimated 
to be 30,000-60,000 bp. Markers that are so closely spaced 
eschibit a high frequency co-inheritemce, and are extremely 
useful in the identification of genetic mutations, such as, 
for example, mutations within the HKNGl gene, and the 
diagnosis of diseases and disorders related to HKNGl 

' mutations. 

to Also, caskey et al. (U.S. Pat. No. 5,364,759) describe a 

DNA profiling assay for detecting short tri and tetra 
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nucleotide repeat sequences. The process includes extracting 
the DNA of interest, such as the HKNGl gene, amplifying the 
extracted DNA, and labelling the repeat sequences to form a 
genotypic map of the individual's DNA. 
^ Other methods well known in the art may be used to 

identify single nucleotide polymorphisms (SNPs) , including 
biallelic SNPs or biallelic markers which have two alleles, 
both of which are present at a fairly high frequency in a 
population. Conventional techniques for detecting SNPs 
include, e.g., conventional dot blot analysis, single 
stranded conformational polymorphism (SSCP) analysis (see, 
e.g., Orita et al., 1989, j>roc. Natl. Acad, Sci. aSA fl5:2766- 
2770), denaturing gradient gel electrophoresis (DGGE) , 
heterodulex analysis, mismatch cleavage detection, and other 
routine techniques well known in the art (see, e.g., 
Sheffield et al., 1989, Proc. Natl. Acad. Sci. 86:5855-5892; 
" Grompe, 1993, Nature Genetics 5: 111-117). Alternative, 
preferred methods of detecting and mapping SNPs involve 
microsequencing techniques wherein an SNP site in a target 
DNA is detecting by a single nucleotide primer extension 
reaction (see, e.g., Goelet et al., PCT Publication No. 
W092/15712; Mundy, U.S. Patent Ho. 4,656,127; Vary and 
Diamond, U.S. Patent No. 4,851,331; Cohen et al., PCT 
Publication No. WO91/02087; Chee et al., PCT iniblication No. 
W095/11995; Landegren et al. , 1988, Science 241:1077-1080; 
Nicerson et al., 1990, J>roc. Natl. Acad. Sci. U.S.A. 8728923- 
8927; Pastinen et al., 1997, Genome J?es. 7:606-614; Pastinen 
et al., 1996, Clin. Cbem. 42:1291-1397 f Jalanko et al., 1992, 
Clin. Chem. 38:39-43; Shumaker et al., 1996, Hum. Wutation 
7:346-354; Caskey et al., PCT Publication No. wo 95/00669). 

The level of EKNGl gene expression can also be assayed. 
For example, PNA from a cell type or tissue known, or 
suspected, to express the HXNGl gene, such as brain, may be 
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isolated and tested utilizing hybridization or PGR techniques 
such as are described, above. The isolated cells can be 
derived from cell culture or from a patient. The analysis of 
cells taken from culture may be a necessary step in the 
assessment of cells to be used as part of a cell-based gene 
5 therapy technique or, alternatively, to test the effect of 
compounds on the expression of the HKNGl gene. Such analyses 
may reveal both quantitative and qualitative aspects of the 
expression pattern of the HKNGl gene, including activation or 
inactivation of HKNGl gene expression. 
10 In one embodiment of such a detection scheme, a cDNA 

molecule is synthesized from an RNA molecule of interest 
(e.g., by reverse transcription of the RNA molecule into 
cDNA) . A sequence within the cDNA is then used as the 
template for a nucleic acid amplification reaction, such as a 
PGR amplification reaction, or the like. The nucleic acid 
15 reagents used as synthesis initiation reagents (e.g., 
primers) in the reverse transcription and nucleic acid 
amplification steps of this method are chosen from among the 
HKNGl gene nucleic acid reagents described in Section 5.1. 
The preferred lengths of such nucleic acid reagents are at 
20 ^^^^ nucleotides. For detection of the amplified 

product, the nucleic acid amplification may be performed 
using radioactively or non-radioactively labeled nucleotides. 
Alternatively, enough amplified product may be made such that 
the product may be visualized by standard ethidium bromide 
staining or by utilizing any other suitable nucleic acid 
staining method. 

25 

Additionally, it is possible to perform such HKNGl gene 
expression assays "in situ«. I.e., directly upon tissue 
siections (fixed and/or frozen) of patient tissue obtained 
from biopsies or resections, such that no nucleic acid 
purification is necessary. Nucleic acid reagents such as 
30 those described in Section 5.1 may be used as probes and/ or 
primers for such in situ procedxires (see, for example, Nuovo, 
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G.J., 1992, »PCR In Situ Hybridization: Protocols And 
Applications", Raven Press, NY). 

Alternatively, if a sufficient quantity of the 
appropriate cells can be obtained, standard Northern analysis 
can be performed to determine the level of mRNA expression of 
* the HKNGl gene. 



5.7. DETECTIO W QP mgjai geKE PROnnCTC 
Antibodies directed against unimpaired or mutant HKNGl 
gene products or conserved variants or peptide fragments 
10 thereof, which are discussed, above, in Section 5.3, may also 
be used as diagnostics and prognostics for a HKWGI-mediated 
disorder, a neuropsychiatric disorder such as BAD or 

schizophrenia. Such methods may be used to detect 
abnormalities in the level of HKNGl gene product synthesis or 
expression, or abnormalities in the structure, temporal 
15 expression, and/or physical location of HKNGl gene product. 
The antibodies and immunoassay methods described herein have, 
for example, important in vitro applications in assessing the 
efficacy of treatments for ffiOTGl -mediated disorders. 
Antibodies, or fragments of antibodies, such as those 
20 described below, may be used to screen potentially 

therapeutic compounds xn vitro to determine their effects on 
HKNGl gene expression and HKNGl gene product production. The 
compounds that have beneficial effects on a -mediated 
disorder, such as BAD or schizophrenia. 

In vitro immunoassays may also be used, for example, to 
assess the efficacy of cell-based gene therapy for a HKNGl- 
mediated disorder, e^, a neuropsychiatric disorder, such as 
BAD schizophrenia. Antibodies directed against HKNGl gene 
products may be used in vitro to determine, for example, the 
level of HKNGl gene expression achieved in cells genetically 
engineered to produce HKNGl g ne product. In the case of 
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intracellular HKNGl gene products, such an assessment is 
done, preferably, using cell lysates or extracts. Such 
analysis will allow for a determination of the number of 
transformed cells necessary to achieve therapeutic efficacy 
^ in vavo, as well as optimization of the gene replacement 
protocol . 

Itoe tissue or cell type to be analyzed will generally 
include those that are known, or suspected, to express the 
HKNGl gene. The protein isolation methods employed herein 
may, for example, be such as those described in Harlow and 

10 Lane (1988, -Antibodies: A Laboratory Manual", Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, New York) . The 
isolated cells can be derived from cell culture or from a 
patient. The analysis of cells taken from culture may be a 
necessary step in the assessment of cells to be used as r,art 
Of a cell-based gene therapy technique or, alternatively', to 

15 test the effect of compounds on the expression of the HKNGl 
gene. 

Preferred diagnostic methods for the detection of HKNGl 
gene products, conserved variants or peptide fragments 
thereof, may involve, for example, immunoassays wherein the 
HKNGl gene products or conserved variants or peptide 
fragments are detected by their interaction with an anti- 
BKNGl gene product-specific antibody. 

Por example, antibodies, or fragments of antibodies 
such as those described, above, in Section 5.3, may be used 
to quantitatively or qualitatively detect the presence of 
25 HKNGl gene products or conserved variants or peptide 

fragments thereof. This can be accomplished, for example, by 
iamunofluorescence techniques employing a fluorescently 
labeled antibody (see below, this Section) coupled with light 
microscopic, flow cytometric, or fluorimetric detection, 
such techniques are especially preferred for HKNGl gene. 
30 products that are expressed on the cell surface. 
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The antibodies (or fragments thereof) useful in the 
present invention may, additionally, be employed 
histologically, as in immunofluorescence or immunoelectron 
microscopy, for in situ detection of HKNGl gene products, 
conserved Vcuriemts or peptide fragments thereof. In situ 

5 

detection may be accomplished by removing a histological 
specimen from a patient, and applying thereto a labeled 
antibody that binds to an rTs polypeptide. The antibody (or 
fragment) is preferably applied by overlaying the labeled 
antibody (or fragment) onto a biological sample. Through the 
use of such a procedure, it is possible to determine not only 
the presence of the HKNGl gene product, conserved vetriants or 
peptide fragments, but also its distribution in the examined 
tissue. Using the present invention, those of ordinary skill 
will readily recognize that any of a wide variety of 
histological methods (such as staining procedures) can be 

JL5 modified in order to achieve in situ detection of a HKNGl 
gene product. 

Immunoassays for HKNGl gene products, conserved 
varicuits, or peptide fragments thereof will typically 
comprise incubating a sample, such as a biological fluid, a 
tissue extract, freshly h£urvested cells, or lysates of cells 

20 in the presence of a detectzdaly labeled antibody capable of 
identifying HKNGl gene product, conserved variants or peptide 
fragments thereof, and detecting the bound antibody by amy of 
a nimber of techniques well-3cnown in the art. 

The biological sample may be brought in contact with and 
immobilized onto a solid phase support or ccirrier, such as 

25 

nitrocellulose, that is cap£a>le of immobilizing cells, cell 
partiicles or soluble proteins. The support may then be 
washed with suitable buffers followed by treatment with the 
detectably labeled HKNGl gene product specific antibody. The 
solid phase support may then be washed with the buffer a 
second time to remove xinbound emtibody. The amount of bound 
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label on the solid support may then be detected by 
conventional means. 

By "solid phase support or carrier" is intended any 
support capable of binding an antigen or an antibody. Well- 
known supports or carriers include glass, polystyrene, 
5 polypropylene, polyethylene, dextran, nylon, amylases, 
natural and modified celluloses, polyacrylamidss , gabbros, 
and magnetite. The nature of the carrier can be either 
soluble to some extent or insoluble for the purposes of the 
present invention. The support material may have virtually 
any possible structural configuration so long as the coupled 
10 molecule is capable of binding to an antigen or antibody. 
Thus, the support configuration may be spherical, as in a 
bead, or cylindrical, as in the inside surface of a test 
tube, or the external surface of a rod. Alternatively, the 
surface may be flat such as a sheet, test strip, etc. 
Preferred supports include polystyrene beads. Those skilled 
15 in the art will know many other suitable carriers for binding 
antibody or antigen, or will be able to ascertain the same by 
use of routine experimentation. 

One of the ways in which the HKNGl gene product-specific 
antibody can be detectably labeled is by linking the same to 
20 ®^zyme, such as for use in an enzyme immunoassay (EIA) 

(Voller, A., "The Enzyme Linked Immunosorbent Assay (ELISA)", 
1978, Diagnostic Horizons 2:1-7, Microbiological Associates " 
Quarterly Publication, WalkersviUe, MD) ; Voller, A. et aJ., 
1978, J. Clin. Pathol. 31j507-520; Butler, J.E., 1981, Meth. 
Enzymol. 73:482-523; Maggio, E. (ed.), 1980, Enzyme 
25 Immunoassay, CRC Press, Boca Raton, FL,; ishikawa, E. et al., 
(eds.), 1981, Enzyme Immunoassay, Kgaku Shoin, Tokyo). The 
enzyme which is bound to the antibody will react with an 
appropriate substrate, preferably a chromogenic substrate, in 
such a maimer as to produce a chemical moiety that can be 
- detected, for example, by spcctrophotometric , fluorimetric or 
30 by visual means. Enzymes that can be used to detectably 
label the antibody include, but are not limited to, malate 
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dehydrogenase, staphylococcal nuclease, delta-5-steroid 
isonerase, yeast alcohol dehydrogenase, a-glycerophosphate, 
dehydrogenase, triose phosphate isomerase, horseradish 
peroxidase, alkaline phosphatase, asparaginase, glucose 
oxidase, 3-galactosidase, ribonuclease, urease, catalase, 
5 glucose-6-phosphate dehydrogenase, glucoamylase and 

acetylcholinesterase. The detection can be accomplished by 
color imetric methods that employ a chromogenic substrate for 
the enzyme. Detection may also be accomplished by visual 
comparison of the extent of enzymatic reaction of a substrate 
in comparison with similarly prepared standards. 

Detection may also be accomplished using any of a 
variety of other immunoassays. For example, by radioactively 
labeling the antibodies or antibody fragments, it is possible 
to detect HKNGl gene products through the use of a 
radioimmunoassay (RIA) (see, for example, Weintraub, B., 
Principles of Radioimmunoassays, Seventh Training Course on 
Radioligand Assay Techniques, The Endocrine Society, March, 
1986) . The radioactive isotope can be detected by such means 
as the use of a gamma counter or a scintillation counter or 
by autoradiography. 

It is also possible to label the antibody with a 
fluorescent compound. When the f luorescently labeled 
antibody is exposed to light of the proper wave length, its 
presence can then be detected due to fluorescence. Among the 
most commonly used fluorescent labeling compounds are 
fluorescein isothiocyanate , rhodamine, phycoerythrin, 
phycocyanin, allophycocyanin, o-phthaldehyde and 
2g f luorescamine. 

The antibody can also be detectably labeled using 
fluorescence emitting metals such as "^Eu, or others of the 
lanthanide series. These metals can be attached to the 
antibody using such metal chelating groups as 
diethylenetriaminepentacetic acid (DTPA) or 
30 ^^^ylenediamin tetraacetic acid (EDTA) . 
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The antibody also can be detectably labeled by coupling 
It to a chemilufflinescent compound. The presence of the 
chemilminescent-tagged antibody is then determined by 
detecting the presence of luminescence that arises during the 
course of a chemical reaction. Examples of particularly 
useful chemiluminescent labeling compounds are luminol 
isoluminol, theromatic acridinium ester, imidazole, 
acridinium salt and oxalate ester. 

Likewise, a bioluminescent compound may be used to label 
the antibody of the present invention. Bioluminescence is a 
type Of chemiluminescence found in biological systems in 
Which a catalytic protein increases the efficiency of the 
Chemiluminescent reaction. The presence of a bioluminescent 
protein is determined by detecting the presence of 
luminescence. Important bioluminescent compounds for 
purposes of labeling are luciferin, luciferase and aequorin. 

5.8. SCREENIKG ASSAYS POR CQMPODNDS 

THAT MOnntATE mrww? gEMB AOTTirrn.v 

The following assays are designed to identify compounds 
that bind to a SKNGl gene product, compounds that bind to 
proteins, or portions of proteins that interact with a HKNGl 

20 gene product, compounds that modulate, e.g., interfere with, 
the interaction of a HKNGl gene product with proteins and 
compounds that modulate the activity of the SKNGl gene (i.e., 
modulate the level of HKNGl gene expression and/or modulate ' 
the level of HKNGl gene product activity) . Assays may 

„ additionally be utilized that identify compounds that bind to 
BKNGl gene regulatory sequences (e.g., promoter sequences; 
see e.g., Piatt, 1994, J. Biol. Chem. 269, 28558-28562), and 
that can modulate the level of HKNGl gene expression. Such 
compounds may include, but are not limited to, small organic 
molecules, such as ones that are able to cross the blood- 
brain barrier, gain to and/or entry into an appropriate cell 
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and affect expression of the HKNGl gene or some other gene 
involved in a HKNGl regulatory pathway. 

Methods for the identification of such proteins are 
described, below, in Section 5.8.2. such proteins may be 
5 involved in the control and/or regulation of mood. Further 
among these compounds are compounds that affect the level of 
HKNGl gene expression and/or SKNGl gene product activity and 
that can be used in the therapeutic treatment of HKNGl- 
»ediated disorders, neuropsychiatric disorders such as 

BAD and schizophrenia as described, below, in Section 5.9. 
10 compounds may include, but are not limited to, peptides 

such as, for example, soluble peptides, including but not 
Umited to, lg>tailed fusion peptides, and members of random 
peptide libraries; (see, e.g.. Lam, et al., 1991, Mature 
354:82-84; Houghten, et al., 1991, Nature 354:84-86), and 
combinatorial chemistry-derived molecular library made of D- 
and/or L- configuration amino acids, phosphopeptides 
(including, but not limited to members of random or partially 
degenerate, directed phosphopeptide libraries; see, e.g., 
Songyang, et al., 1993, cell 72:767-778), antibodies 
(including, but not limited to, polyclonal, monoclonal, 
20 humanized, anti-idiotypic, chimeric or single chain 
antibodies, and PAb, F(ab'), and PAb expression library 
fragments, and epitope-binding fragments thereof) , and small 

organic or inorganic molecules. 

Such compounds may further comprise compounds, in 
particular drugs or members of classes or families of drugs 
25 known to ameliorate the symptoms of a HlWGl -mediated 

disorder, e^, a neuropsychiatric disorder such as BAD or 
schizophrenia . 

such compounds include families of antidepressants such 
as lithium salts, carbamazepine, valproic acid, lysergic acid 
-diethylamide (LSD), p-chlorophenylalanine, p- 
30 propyldopacetamide dithiocarbamate derivatives e.g., pla 63; 
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anti-anxiety drugs, e.g., diazepam; monoamine oxidase (MAO) 
inhibitors, e.gr., iproniazid, clorgyline, phenelzine and 
isocarboxazid; biogenic amine uptake blockers, e.g., 
tricyclic antidepressants such as desipramine, imipramine and 
5 amitriptyline; serotonin reuptake inhibitors e.g., 
fluoxetine; antipsychotic drugs such as phenothiazine 
derivatives (e.g., chlorpromazine (thorazine) and 
trif luopromazine) ) , butyrophenones (e.g., haloperidol 
(Haldol)), thioxanthene derivatives (e.g., chlorprothixene) , 
and dibenzodiazepines (e.g., clozapine); benzodiazepines; 
dopaminergic agonists and antagonists e.g., l-dopa, cocaine, 
amphetamine, a-methyl-tyrosine, reserpine, tetrabenazine, 
benzotropine, pargyline; noradrenergic agonists and 
antagonists e.g., clonidine, phenoxybenzamine, phentolamine , 
tropolone . 

" compounds identified via assays such as those described 

herein may be useful, for example, in elaborating the 
biological function of the SKNGl gene product and for 
ameliorating IHWGI -mediated neuropsychiatric disorders, such 
as BAD and schizophrenia. Assays for testing the 

2g effectiveness of compounds identified by, for example, 

techniques such as those described in Sections 5.8.1 - 5.8.3, 
are discussed, below, in Section 5.8.4. 

5.8.1. XW VITRO SCRBEHIMG ASSAYS PGR COHPOOMDS 
THAT Bum TO WES snrai sehe PHonnCT 

25 Jn vitro systems may be designed to identify compounds 

capable of binding the EXNGl gene products of the invention. 
Compounds identified may be useful, for example, in 
modulating the activity of unimpaired and/or mutant BKNGl 
gene products, may be useful in elaborating the biological 
function of the HKNGl gene product, may be utilized in 
screens for identifying compounds that disrupt normal SKNGl 
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gene product interactions, or may in themselves disrupt such 
interactions. 

The principle of the assays used to identify compounds 
that bind to the HKNGl gene product involves preparing a 
^ reaction mixture of the HKNGl gene product and the test 
compound under conditions and for a time sufficient to allow 
the two components to interact and bind, thus forming a 
complex that can be removed and/ or detected in the reaction 
mixture. These assays can be conducted in a variety of ways. 
For example, one method to conduct such an assay involves 
anchoring a HKNGl gene product or a test substance onto a 
solid support and detecting HKNGl gene product/test compound 
complexes formed on the solid support at the end of the 
reaction. In one embodiment of such a method, the HKNGl gene 
product may be anchored onto a solid support, and the test 
compound, which is not anchored, may be labeled, either 

■5 directly or indirectly. 

In practice, microtiter plates are conveniently 
utilized as the solid support. The anchored component may be 
immobilized by non-covalent or covalent attachments. Non- 
covalent attachment may be accomplished by simply coating the 
solid surface with a solution of the protein and drying. 

® Alternatively, an immobilized cintibody, preferably a 
monoclonal antibody, specific for the protein to be 
immobilized may be used to anchor the protein to the solid 
surface. The surfaces may be prepared in advance and stored. 

In order to conduct the assay, the non- immobilized 
component is added to the coated surface containing the 

5 anchored component. After the reaction is complete, 

unreacted components are removed (e.g., by washing) under 
conditions such that any complexes formed will remain 
immobilized on the solid surface. The detection of complexes 

. anchored on the solid surface can be accomplished in a number 
of ways. Where the previously non- immobilized component is 

0 

pre-labeled, the detection of led>el immobilized on the 
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surface indicates that complexes were formed. Where the. 
previously non-immobilized component is not pre-labeled, an 
indirect label can be used to detect complexes anchored on 
the surface; e.g^, using a labeled antibody specific for the 
^ previously non-immobilized component (the antibody, in turn, 
may be directly labeled or indirectly labeled with a labeled 
anti-Ig antibody) . 

Alternatively, a reaction can be conducted in a liquid 
phase, the reaction products sepearated from xinreacted 
components, and complexes detected; e.gr. , using an 
immobilized antibody specif ic for HKNGl gene product or the 
test compound to anchor any complexes formed in solution, and 
a -labeled antibody specific for the other component of the 
possible complex to detect anchored complexes. 

5.8.2 • ASSAYS FOR PROTEINS THAT IKTERACT 
WITH EKNGl GENE PRODITCTS 

Any method suitable for detecting protein-protein 
interactions may be employed for identifying HKNGl gene 
product-protein interactions. 

Among the traditional methods that may be employed Eire 
co-immunoprecipitation, cross-linking and co-purification 

20 through gradients or chromatographic columns. Utilizing 
procedvures such as these allows for the identification of 
proteins, including intracellular proteins, that interact 
with HKNGl gene products. Once isolated, such a protein can 
be identified and oan be used in conjunction with standard 
techniques, to identify proteins it interacts with. For 

25 example, at least a portion of the cimino acid sequence of a 
protein that interacts with the HKNGl gene product can be 
ascertained using techniques well known to those of skill in 
the art, such as via the Edman degradation technique (see, 
_ e.g., Creighton, 1983, "Proteins: Structures and Molecular 

30 P^ii^^ipl^s," W.H. Freeman & Co., N.Y., pp. 34-49). The amino 
acid sequence obtained may be used as a guide for the 
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generation of oligonucleotide mixtures that can be used to 
screen for gene sequences encoding such proteins. Screening 
made be accomplished, for example, by standard hybridization 
or PGR techniques. Techniques for the generation of 
^ oligonucleotide mixtures and the screening are vell-known. 
(See, e.g., Ausubel, supra, and 1990, "PCR Protocols: A 
Guide to Methods and Applications," innis, et al., eds. 
Academic Press, inc.. New York). 

Additionally, methods may be employed that result in the 
simultaneous identification of genes that encode a protein 
which interacts with a HKNGl gene product. These methods 
include, for example, probing expression libraries with 
labeled SKNGl gene product, using EKNGl gene product in a 
manner similar to the well known technique of antibody 
probing of Xgtll libraries. 

one method that detects protein interactions in vivo, 
^5 the two-hybrid system, is described in detail for 

illustration only and not by way of limitation. One version 
Of this system has been described (caiien, et aJ., 1991, Proc. 
Natl. Acad. sci. USA, 88:9578-9582) and is commercially 
available from Clontech (Palo Alto, CA) . 
20 Briefly, utilizing such a system, plasmids are 

constructed that encode two hybrid proteins: one consists of 
the DNA-binding domain of a transcription activator protein 
fused to the SXNGl gene product and the other consists of the 
transcription activator protein's activation domain fused to 
an unknown protein that is encoded by a cDNA that has been 
25 recombined into this plasmid as part of a cDNA library. The 
DNA-binding domain fusion plasmid and the cDNA library are 
transformed into a strain of the yeast Saccharomyces 
cerevisiae that contains a reporter gene (e.g., hbs or lacZ) 
whose regulatory region contains the transcription 
. activator's binding site. Either hybrid protein alone cannot 
30 activate transcription of the reporter gene: the DNA-binding 
domain hybrid cannot because it does not provide activation 
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function and the activation domain hybrid cannot because it 
cannot localize to the activator's binding sites. 
Interaction of the two hybrid proteins reconstitutes the 
functional activator protein and results in expression of the 
reporter gene, which is detected by an assay for the reporter 
^ gene product. 

The two-hybrid system or related methodologies may be 
used to screen activation domain libraries for proteins that 
interact with the "bait" gene product. By way of example, 
and not by way of limitation, HENGl gene products may be used 

10 ^® ^^^^ product. Total genomic or cDNA sequences 

are fused to the DNA encoding an activation domain. This 
library and a plasmid encoding a hybrid of a bait HKNGl gene 
product fused to the DNA-binding domain are co-transformed 
into a yeast reporter strain, and the resulting transformants 
are screened for those that express the reporter gene. Por 
example, a bait HKNGl gene sequence, such as the open reading 
frame of the HKNGl gene, can be cloned into a vector such 
that it is translationally fused to the DNA encoding the DNA- 
binding domain of the GAL4 protein. These colonies are 
purified and the libreury plasmids responsible for reporter 
gene expression are isolated. DNA sequencing is then used to 

20 identify the proteins encoded by the libr£iry plasmids. 

A CDNA library of the cell line from which proteins that 
interact with bait HKNGl gene product are to be detected can 
be made using methods routinely practiced in the art. 
According to the particular system described herein, for 
example, the cDNA fragments can be inserted into a vector 

25 such that they eure trsmslationally fused to the 

transcriptional activation domain of GAL4. Such a library 
can be co-transformed along with the bait HKNGl gene-GAL4 
fusion plasmid into a yeast strain that contains a lacZ gene 
driven by a promoter that contains GAL4 activation sequence. 
A CDNA encoded protein, fused to a GAIi4 tremscriptional 
activation domain that interacts with bait HKNGl gene product 
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will reconstitute an active GAL4 protein and thereby drive 
expressxon Of the HIS3 gene, colonies that express „XS3 can 
be detected hy their growth on petri dishes containing sel- 
solxd agar based .edia lacking histidine. The cDNA can then 
^ be purafxed fron these strains, and used to produce and 

xsolate the bait EKNGl gene product-interacting protein using 
techniques routinely practiced in the art. 

COMPODMDS TH^T IllTERPERE WITH OR 
?S:^Sxok"^^' M.CROHO.BCOLB 
The HKNGi gene products Bay, in ^rivo, interact with one 
or more macromolecules , including intracellular 

Inclurrf -cromolecules .ay 
include, but are not limited to, nucleic acid molecules and 
those protexns identified via methods such as those 

thxs dxscussxon, the macromolecules are referred to herein as 

bindxng partners-. Compounds that disrupt HKNGl gene 
product binding to a binding partner may be useful in 
regulating the activity of the HKNGl gene product, especially 
mutant HKNGl gene products. Such compounds may include, but 

" TdelLi^^'l "'^"^'"^ 

as descrxbed, for example, in Section 5.8.2 above. 

The basic principle of an assay system used to identify 
expounds that interfere with or potentiate the interaction 
between the HKNGl gene product and a binding partner or 
25 preparing a reaction mixture containing the 

EKNGl gene product and the binding partner under conditions 
and for a time sufficient to allow the two to interact and 

fri«rr/°"''' ' ^° ^^^^ - --p-»^ 

for inhibitory activity, the reaction mixture is prepared in 
the presence and absence of the test compound. The test 
^- compound may be initially included in the reaction mixture 
or may be added at a time subsequent to the addition of HKNGl 
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gene product and its binding. partner. Control reaction 
mixtures are incubated without the test compound or with a 
compound which is known not to block complex formation. The 
formation of any complexes between the EENGl gene product and 
^ the binding partner is then detected. The formation of a 
complex in the control reaction, but not in the reaction 
mixture containing the test compound, indicates that the 
compound interferes with the interaction of the HKNGl gene 
product and the binding partner. Additionally, complex 
formation within reaction mixtures containing the test 
compound and normal HKNGl gene product may also be compared 
to cojnplex formation within reaction mixtures containing the 
test compound and a mutant SKNGl gene product. This 
comparison may be important in those cases wherein it is 
desirable to identify compounds that disrupt interactions of 
mutant but not normal HKNGl gene product. 
*5 in order to test a compound for potentiating activity 

the reaction mixture is prepared in the presence and absence 
of the test compound. The test compound may be initially 
included in the reaction mixture, or may be added at a time 
subsequent to the addition of HKNGl gene product and its 
binding partner. Control reaction mixtures are incubated 
without the test compound or with a compound which is known 
not to block complex formation. The formation of any 
complexes between the HKNGl gene product and the binding 
partner is then detected. Increased formation of a complex 
in the reaction mixture containing the test compound, but not 
„ xn the control reaction, indicates that the compound enhances 
and therefore potentiates the interaction of the HKNGl gene 
product and the binding partner. Additionally, complex 
formation within reaction mixtures containing the test 
compound and normal HKNGl gene product may also be compared 
.to complex formation within reaction mixtures containing the 
30 test compound and a mutant HKNGl gene product. This 
comparison may be important in those cases wherein it is 
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desirable to identify compounds that enhance interactions of 
mutant but not normal HKNGl gene product. 

in alternative embodiments, the above assays may be 
performed using a reaction mixture containing the HKNGl gene 
5 product, a binding partner, and a third which disrupts or 
enhances HKNGl gene product binding to the binding partner. 
The reaction mixture is prepared and incubated in the 
presence and absence of the test compound, as described 
above, and the formation of any complexes between the HKNGl 
gene product and the binding partner is detected, m this 
10 embodiment, the formation of a complex in the reaction 

mixture containing the test compound, but not in the control 
reaction, indicates that the test compound interferes with 
the ability of the second compound to disrupt HKNGl gene 
product binding to its binding partner. 

The assays for compounds that interfere with or 
potentiate the interaction of the HKNGl gene products and 
binding partners can be conducted in a heterogeneous or 
homogeneous format. Heterogeneous assays involve anchoring 
either the HKNGl gene product or the binding partner onto a 
solid support and detecting complexes formed on the solid 
20 support at the end of the reaction. In hoa.ogeneous assays 
the entire reaction is carried out in a liquid phase, in ' 
either approach, the order of addition of reactants can be 
varied to obtain different infonnation about the compounds 
being tested. For example, test compounds that Interfere 
with or potentiate the interaction between the HKNGl gene 
25 products and the binding partners, e.g., by competition, can 
be identified by conducting the reaction in the presence of 
the test substance; i.e., by adding the test substance to the 
reaction mixture prior to or simultaneously with the HKNGl 
gene product and interactive intracellular binding partner. 
Alternatively, test compounds that disrupt preformed 
complexes, e.g., compounds with higher binding constants that 
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10 



displace one of the components from the complex, can be 
tested by adding the test compound to the reaction mixture 
after complexes have been formed. The various formats are 
described briefly below. 

In a heterogeneous assay system, either the HENGl gene 
product or the interactive binding partner, is anchored onto 
a solid surface, while the non-anchored species is labeled 
either directly or indirectly. In practice, microtiter 
plates are conveniently utilized. The anchored species may 
be immobilized by non-covalent or covalent attachments. Non- 
covalent attachment may be accomplished simply by coating the 
solid surface with a solution of the HKNGl gene product or 
binding partner and drying. Alternatively, an immobilized 
antibody specific for the species to be anchored may be used 
to anchor the species to the solid surface. The surf aces may 
be prepared in advance and stored. 

in order to conduct the assay, the partner of the 
immobilized species is exposed to the coated surface with or 
without the test compound. After the reaction is complete, 
unreacted components are removed (e.g., by washing) and any 
complexes formed will remain immobilized on the solid 
surface. The detection of complexes anchored on the solid 
20 surface can be accomplished in a number of ways, where the 
non-immobilized species is pre-labeled, the detection of 
label immobilized on the surface indicates that complexes 
were formed. Where the non-immobilized species is not pre- 
labeled, an indirect label can be used to detect complexes 
anchored on the surface? e.g., using a labeled antibody 
25 specific for the initially non-immobilized species (the 
antibody, in turn, may be directly labeled or indirectly 
labeled with a labeled anti-lg antibody) . Depending upon the 
order of addition of reaction components, test compounds that 
inhibit complex formation or that disrupt preformed complexes 
_ can be detected. 

30 Alternatively, the reaction can be conducted in a liquid 

phase in the presence or absence of the test compound, the 
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reaction /products separated from unreacted components, and 
complexes detected; .g,, using an immobilized antibody 
specific for one of the binding components to anchor any 
complexes formed in solution, and a labeled antibody specific 
^ for the other partner to detect anchored complexes. Again, 
depending upon the order of addition of reactants to the 
liquid phase, test compounds that inhibit complex formation 
or that disrupt preformed complexes can be identified. 

In an alternate embodiment of the invention, a 
homogeneous assay can be used, in this approach, a preformed 
complex of the HKNGl gene product and the interactive binding 
partner is prepared in which either the HKITGl gene product or 
its binding partners is labeled, but the signal generated by 
the label is quenched due to complex formation (see, e.g., 
U.S. Patent No. 4,109,496 by Rubenstein which utilizes this 
approach for immunoassays) . The addition of a test substance 
15 that competes with and displaces one of the species from the 
preformed complex will result in the generation of a signal 
above background. In this way, test substances that disrupt 
SKNGl gene product/binding partner interaction can be 
identified. 

In another embodiment of the invention, these same 
20 techniques can be employed using peptide fragments that 

correspond to the binding domains of the HKNGl product and/or 
the binding partaer (in cases where the binding partner is a 
protein) , in place of one or both of the full length 
proteins. Any number of methods routinely practiced in the 
art can be used to identify and isolate the binding sites. 
These methods include, but are not limited to, mutagenesis of 
the gene encoding one of the proteins and screening for 
disruption of binding in a co-immunoprecipitation assay. 
Compensating mutations in the gene encoding the second 
^ species in the complex can then be selected. Sequence 
analysis of the genes encoding the respective proteins will 
reveal the mutations that correspond to the region of the 
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protein involved in interactive binding. Alternatively, one 
protein can be anchored to a solid surface using methods 
described in this Section above, and allowed to interact with 
and bind to its labeled binding partner, which has been 
treated with a proteolytic enzyme, such as trypsin. After 
5 washing, a short, labeled peptide comprising the binding 
domain may remain associated with the solid material, which 
can be isolated and identified by amino acid sequencing. 
Also, once the gene coding for the segments is engineered to 
express peptide fragments of the protein, it can then be 
tested for binding activity and purified or synthesized. 

For example, and not by way of limitation, a HKNGl gene 
product can be anchored to a solid material as described, 
above, in this Section by making a GST-HKNGl fusion protein 
and allowing it to bind to glutathione agarose beads. The 
binding partner can be labeled with a radioactive isotope, 
such as "S, and cleaved with a proteolytic enzyme such as 
trypsin. Cleavage products can then be added to the anchored 
GST'HKNGl fusion protein and allowed to bind. After washing 
away unbound peptides, labeled bound material, representing 
the binding partner binding domain, can be eluted, purified, 
and analyzed for amino acid sequence by well-known methods.' 
20 Peptides so identified can be produced synthetically or 
produced using recombinant DNA technology. 

5.8.4. ASSAYS FOR IDBNTZFXGATZON OP COMPOUKDS 

THAT AMBLIORATE A RKMSl-MEDTAOJED DISORDER 

Compounds, including but not limited to binding 
25 compounds identified via assay techniques such as those 
described, above, in Sections 5.8.1 - 5.8.4, can be tested 
for the ability to ameliorate symptoms of a HKWGI -mediated 
disorder, e^, a CNS-related disorder, such as a 
neuropsychiatric disorder, including schizophrenia and 
-bipolar af f ctive (mood) disorders, including severe bipolar 
30 affective (mood) disorder (BP-I) , bipolar affective (mood) 
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disorder with hypomania and major depression (BP-II) , and 
myopia disorders 

It should be noted that the assays described herein can 
identify compounds that affect SKNGl activity by either 
^ affecting HKNGl gene expression or by affecting the level of 
HKNGl gene product activity. For example, compounds may be 
identified that are involved in another step in the pathway 
in which the HKNGl gene and/or HKNGl gene product is involved 
and, by affecting this same pathway may modulate the effect 
of HKNGl on the development of a JIIWGI -mediated disorder. 
Such compounds can be used, e.g. , as part of a therapeutic 
method for the treatment of the disorder. 

Described below are cell-based and animal model-based 
assays for the identification of compounds exhibiting such an 
ability to ameliorate symptoms of a HiWGI -mediated disorder, 
e^cLs., neuropsychiatric disorder, such as BAD or 
schizophrenia. 

First, cell-based systems can be used to identify 
compounds that may act to ameliorate symptoms of a HKNGl- 
mediated disorder. Such cell systems can include, for 
example, recombinant or non-recombinant cell, such as cell 
20 express the HKNGl gene. 

In utilizing such cell systems, cells that express HKNGl 
may be exposed to a compound suspected of exhibiting an 
ability to ameliorate symptoms of sl HKNGl'-mBdxeit&d disorder, 
Bj^, a neuropsychiatric disorder, such as BAD or 
schizophrenia, at a sufficient concentration and for a 
25 sufficient time to elicit such an amelioration of such 

symptoms in the exposed cells. After exposure, the cells can 
be assayed to rieasure alterations in the expression of the 
HKNGl gene, e.g., by assaying cell lysates for HKNGl mSNA 
transcripts (e.g., by Northern analysis) or for HKNGl gene 
30 P^o^^^s expressed by the cell; compounds that modulate 
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expression of the HKNGl gene are good candidates as 
therapeutics. 

In addition, animal-based systems or models for a HKNGl- 
mediated disorder, e^, neuropsychiatric disorder, for 
5 example, transgenic mice containing a human or altered fr.^ 
Of EKNGl gene, may be used to identify compounds capable of 
ameliorating symptoms of the disorder, such animal models 
may be used as test substrates for the identification of 
drugs, pharmaceuticals, therapies and interventions. For 
example, animal models may be exposed to a compound suspected 
10 of exhibiting an ability to ameliorate symptoms, at a 

sufficient concentration and for a sufficient time to elicit 
such an amelioration of symptoms of a HJOTGl-mediated 
disorder. The response of the animals to the exposure may be 
monitored by assessing the reversal of the symptoms of the 
disorder. 

15 With regard to intervention, any treatments that reverse 

any aspect of symptoms of a FJCffGl -mediated disorder, should 
be considered as candidates for human therapeutic 
Intervention in such disorders. Dosages of test agents may 
be determined by deriving dose-response curves, as discussed 
in Section 5.10.1, below. 



20 



5.9. COMPOUBDS ASD METHODS FOR THE TRBATMBNT OP 
HKNGl -m^IATED DI^BORDBafl 

Described below are methods and compositions whereby a 
fflOTGl-mediated disorder described herein, e^, a SKNGl- 
mediated neuropsychiatric disorder, such as BAD or 
schizophrenia, may be treated. For example, such methods can 
comprise administering compounds which modulate the 
expression of a mammalian HKNGl gene and/or the synthesis or 
activity of a mammalian HKNGl gene product (e.g., a 
.recombinant HKNGl gene product) so symptoms of the disorder 
30 are ameliorated. 
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Alternatively, in those instances whereby the HKNGl- 
mediated disorders result from HKNGl gene mutations, such 
methods can comprise supplying the subject with a nucleic 
acid molecule encoding an unimpaired HKNGl gene product such 
5 that an unimpaired HKNGl gene product is expressed and 
symptoms of the disorder are ameUorated. 

in another embodiment of methods for the treatment of 
fflWGl-mediated disorders resulting from HKNGl gene mutations, 
such methods can comprise supplying the subject with a cell 
comprising a nucleic acid molecule that encodes an unimpaired 
EKNGl gene product such that the cell expresses the 
unimpaired HKNGl gene product and symptoms of the disorder 
are ameliorated. 

in cases in which a loss of normal HKNGl gene product 
function results in the development of a flKWGl-mediated 
15 disorder an increase in HKNGl gene product activity would 
facilitate progress towards an asymptomatic state in 
individuals exhibiting a deficient level of HKNGl gene 
expression and/or HKNGl gene product activity. Methods for 
enhancing the expression or synthesis of HKNGl can include, 
20 for example, methods such as those described below, in 
Section 5.9.2. 

Alternatively, symptoms of HKWGl-mediated 
neuropsychiatric disorders, may be ameliorated by 
administering a compound that decreases the level of HKNGl 
gene expression and/or HKNGl gene product activity. Methods 
for inhibiting or reducing the level of HKNGl gene product 
synthesis or expression can include, for example, methods 
such as those described in Section 5,9.1. 

In one embodiment of treatment methods, the compounds 
_ administered comprise compounds, in particular drugs, which 
3P ameliorate the symptoms of a disorder described herein as a 
neuropsychiatric dis rder, such as BAD or schizophrenia. 
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such compounds include drugs within the families of 
antidepressants such as lithium salts, carbamazepine, 
valproic acid, lysergic acid diethylamide (LSD), p- 
chlorophenylalanine, p-propyldopacetamide dithiocarbamate 
5 derivatives e.g., FLA 63; anti-anxiety drugs, e.g., diazepam; 
monoamine oxidase (MAO) inhibitors, e.g., iproniazid, 
clorgyline, phenelzine and isocarboxazid; biogenic amine 
uptake blockers, e.g., tricyclic antidepressants such as 
desipramine, imipramine and amitriptyline; serotonin reuptake 
inhibitors e.g., fluoxetine; antipsychotic drugs such as 
phenothiazine derivatives (e.g., chlorpromazine (thorazine) 
and trifluopromazine)), butyrophenones (e.g., haloperidol 
(Haldol)), thioxanthene derivatives (e.g., chlorprothixene) , 
and dibenzodiazepines (e.g., clozapine); benzodiazepines; 
dopaminergic agonists and antagonists e.g., L-DOPA, cocaine, 
amphetamine, a-methyl-tyrosine , reserpine, tetrabenazine, 
benzotropine, pargyline; noradrenergic agonists and 
antagonists e.g., clonidine, phenoxybenzamine, phentolamine, 
tropolone. 

In another embodiment, symptoms of a disorder described 
2^ herein, e.g., a HKffGl-mediated neuropsychiatric disorder such 
as BAD or schizophrenia, may be ameliorated by BKNGl protein 
therapy methods, e^, decreasing or increasing the level 
and/ or of HKNGl-activity using the HKNGl protein, fusion 
protein, and peptide sequences described in Section 5.2, 
above, or by the administration of proteins or protein ' 
25 fragments (e.g., peptides) which interact with an HKNGl gene 

or gene product and thereby inhibit or potentiate its 
activity. 

such protein therapy may include, for exaa^le, the 
administration of a functional HKNGl protein or fragments of 
^- an HKNGl protein ( .g., peptides) which represent functional 
BKNGl domains. 



- 82 - 



wo 99/47535 



PCT/US99/05606 



in one embodiment, HKNGl fragments or peptides 
representing a functional HKNGl binding domain are 
administered to an individual such that the peptides bind to 
an HKNGl binding protein, e.gr., an HKNGl receptor. Such 
^ fragments or peptides may serve to inhibit HKNGl activity in 
an individual by competing with, and thereby inhibiting 
binding of HKNGl to the binding protein, thereby ameliorating 
symptoms of a disorder described herein. Alternatively, such 
fragments or peptides may enhance HKNGl activity in an 
individual by mimicking the function of HKNGl in vivo, 

thereby ameliorating the symptoms of a disorder described 
herein. 

The proteins and peptides which may be used in the 
methods Of the invention include synthetic (e.g., recombinant 
or Chemically synthesized) proteins and peptides, as well as 
naturally occurring proteins and peptides. The proteins and 
IS peptides may have both naturally occurring and non-naturally 
occurxng amino acid residues (e.g., D-amino acid residues) 
and/or one or more non-peptide bonds (e.g., i:ni„o , ester, 
hydrazide, semicarbazide, and azo bonds). The proteins or 
peptides may also contain additional chemical groups (i.e., 
functional groups) present at the amino and/or carboxy 
termini, such that, for example, the stability 
Moavailability, and/or inhibitory activity of the peptide is 
enhanced. Exemplary functional groups include hydrophobic 
groups (e.g. carbobenzoxyl, dansyl, and t-butyloxycarbonyl, ' ' 
groups) , an acetyl group, a 9-fluorenylmethoxy-carbonyl 
25 group, and macromolecular carrier groups (e.g., lipid-fatty 
acid conjugates, polyethylene glycol, or carbohydrates) 
including peptide groups. 

5.9.1. IHHIBITORY ASTIBENBB, RIBOZYME 
AMD TRIPI ,E WELTTt A PPRQiVCTngl j 

30 In another embodiment, symptoms of BJOTGI -mediated 

neuropsychiatric disorders may be ameliorated by decreasing 
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the level of HKNGl gene expression and/or HKNGl gene product 
activity by using HKNGl gene sequences in conjunction with 
well-known antisense, gene "knock-out," ribozyme and/or 
triple helix methods to decrease the level of SKNGl gene 
5 expression. Among the compounds that may exhibit the ability 
to modulate the activity, expression or synthesis of the 
HKNGl gene, including the ability to ameliorate the symptoms 
of a HiT^Gl -mediated neuropsychiatric disorder, such as BAD or 
schizophrenia, are antisense, ribozyme, and triple helix 
molecules* Such molecules may be designed to reduce or 
inhibit either unimpaired, or if appropriate, mutant target 
gene activity. Techniques for the production and use of such 
molecules are well known to those of skill in the art. 

Antisense RNA and DNA molecules act to directly block 
the translation of mRNA by hybridizing to targeted mRNA and 
preventing protein translation. 2^tisense approaches involve 
the design of oligonucleotides that are complementary to a 
target gene mRNA. The antisense oligonucleotides will bind 
to the complementary target gene mRNA transcripts and prevent 
translation. Absolute complementcirily, although preferred, 
is not required. 

A sequence "complementary" to a portion of an RNA, as 

2® referred to herein, means a sequence having sufficient 

complementarily to be able to hybridize with the RNA, forming 
a stable duplex; in the case of double-stranded antisense 
nucleic acids, a single strand of the duplex DNA may thus be 
tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complementarily 

25 and the length of the antisense nucleic acid. Generally, the 
longer the hybridizing nucleic acid, the more base mismatches 
with an RNA it may contain and still form a stable duplex (or 
triplex, as the case may be) . One skilled in the art can 
ascertain a tolerable degree of mismatch by use of stetndard 
- procedures to determine the melting point of the hybridized 

30 complex. 
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in one embodiment, oligonucleotides complementary to 
Bon-codmg regions of the Hmci gene could be used .n an 
antisense approach to irt^ibit translation of endogenous HK^Gl 
MiKA. Antisense nucleic acids should be at least six 
, nucleotides m length, and are preferably oligonucleotides 
rangang from 6 to about 50 nucleotides in length, m 
specific aspects the oligonucleotide is at least 10 
nucleotides, at least 17 nucleotides, at least 25 nucleotides 
or at least 50 nucleotides. nuoaeotiaes 

Regardless of the choice of target sequence, it is 
„ preferred that In vitro studies are first performed to 
Wantitate the ability of the antisense oligonucleotide to 
xnhxbit gene «,ression. It is preferred that these studies 
u^lize controls that distinguish between antisense gene 
Inhibition and nonspecific biological effects of 
Oligonucleotides, it is also preferred that these studies 

" °' ™* °' P^"'^" "i"- Of an 

internal control RMA or protein. Additionally, it is 

envisioned that results obtained using the antisense 
Oligonucleotide are compared with those obtained using a 
control Oligonucleotide. It is preferred that the control 
Oligonucleotide is of approximately the same length as the 
20 test Oligonucleotide and that the nucleotide sequence of the 
oligonucleotide differs from the antisense sequence no more 
than IS necessary to prevent specific hybridization to the 
target sequence. 

The oligonucleotides can be DNA or RNA or chimeric 
fixtures or derivatives or modified versions thereof, single- 
25 stranded or double-stranded, oligonucleotide can be 

^odlfxed at the base moiety, sugar moiety, or phosphate 
backbone, for example, to improve stability of the molecule 
hybridization, etc. The oligonucleotide may include oZl 
appended groups such as peptides (e.g., for targeting host 
cell receptors in vivo) . or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger, et ai., 1939 
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Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; Lemaitre, et al,, 
1987, Proc. Natl. Acad. Sci. U.S.A. 84:648-652; PCT 
Publication No. W088/09810, published December 15, 1988) or 
the blood-brain barrier (see, e.g., PCT Publication Mo. 
g W089/10134, published April 25, 1988), hybridization- 
triggered cleavage agents (see, e.g., Krol et al., 1988, 
BioTeohniques 6:958-976) or intercalating agents (see, e.g., 
Zon, 1988, Pharm. Res. 5:539-549). To this end, the 
oligonucleotide may be conjugated to another molecule, e.g., 
a peptide, hybridization triggered cross-linking agent, 
^0 transport agent, hybridization-triggered cleavage agent, etc. 

The antisense oligonucleotide may comprise at least one 
modified base moiety which is selected from the group 
including but not limited to 5-f luorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 

4- acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 

5- carboxymethylamindmethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D- 
galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1-methyl inosine, 2,2-dimethylgueuiine, 

2- methyladenine, 2-methylgucmine, 3-methylcytosine, 
5-methylcytosine, N6-adenine, 7-methylguanine, 

2 0 5-methylamlnomethylxaracil , 5-methoxyaminomethyl-2-thiouracil , 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 
5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, 
uracil-5-03^acetic acid (v) , wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 
2-thiouracil, 4-thiouracil , 5-methyluracil, uracil- 
5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v) , 
5-methyl-2-thiouracil, 3- (3-amino-3-H-2-carboxypropyl) 
uracil, (acp3)w, and 2,6-diaminopurine. 

The antisense oligonucleotide may also comprise at least 
one modified sugar moiety selected from the group including 

- but not limited to arabinose, 2-f luoroarabinose, xylulose, 

3® and hexose. 
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In yet another embodiment, the antisense oligonucleotide 
comprises at least one modified phosphate backbone selected 
from the group consisting of a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a 
phosphoramidate, a phcsphordiamidate, a methylphosphonate, an 
* alkyl phosphotriester, and a formacetal or analog thereof. 

In yet another embodiment, the antisense oligonucleotide 
is an a-anomeric oligonucleotide. An a-anomeric 
oligonucleotide forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual p-units, 
the strands run parallel to each other (Gautier, et al.. 

10 

1987, Nucl. Acids Res. 15:6625-6641). The oligonucleotide is 
a 2'-0-methylribonucleotide (Inoue, et al., 1987, Nucl. Acids 
Res. 15:6131-6148), or a chimeric RNA-DNA analogue (Inoue, et 
al., 1987, FEES Lett. 215:327-330). 

Oligonucleotides of the invention may be synthesized by 
15 standard methods known in the art, e.g., by use of an 

automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, 
phosphorothioate oligonucleotides may be synthesized by the 
method of Stein, et al. (1988, Nucl. Acids Res. 16:3209), 
methylphosphonate oligonucleotides can be prepared by use of 
controlled pore glass polymer supports (Sarin, et aJ., 1988, 
Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451) , etc. 

While antisense nucleotides complementary to the target 
gene coding region sequence could be used, those 
complementary to the tremscribed, untranslated region are 
most preferred. 

Antisense molecules should be delivered to cells that 
egress the target gene in vivo. A number of methods have 
been developed for delivering antisense DNA or RNA to cells; 
e.g., antisense molecules can be injected directly into the 
- tissue site, or modified antisense molecules, designed to 
30 target the desired cells (e.g., antisense linked to peptides 
or antibodies that specifically bind receptors or antigens 
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expressed on the target cell surface) can be administered 
systemically. 

A preferred approach to achieve intracellular 
concentrations of the antisense sufficient to suppress 
translation of endogenous mRNAs utilizes a recombinant DNA 
construct in which the antisense oligonucleotide is placed 
under the control of a strong pol iii or pol ii promoter. 
The use of such a construct to transfect target cells in the 
patxent will result in the transcription of sufficient 
amounts of single stranded RNAs that will form complementary 
base pairs with the endogenous target gene transcripts and 
thereby prevent translation of the target gene mRNA. For 
example, a vector can be introduced e.g., such that it is 
taken up by a cell and directs the transcription of an 
antisense SNA. Such a vector can remain episomal or become 
chromosomally integrated, as long as it can be transcribed to 
produce the desired antisense RNA. Such vectors can be 
constructed by recombinant DNA technology methods standard in 
the art. Vectors can be plasmid, viral, or others known in 
the art, used for replication and expression in mammalian 
cells. Expression of the sequence encoding the antisense RNA 
can be by any promoter known in the art to act in mammalian 
preferably human cells. Such promoters can be inducible or' 
constitutive, such promoters include but are not limited to- 
the SV40 early promoter region (Bernoist and Chambon, 1981 
Nature 290:304-310), the promoter contained in the 3' long ' 
terminal repeat of Rous sarcoma virus (Yamamoto, et al., 
1980, cell. 22:787-797), the herpes thymidine kinase promoter 
25 (Wagner, et al., i98l, Proc. Natl. Acad. Sci. U.S.A. 78:i44i- 
1445), the regulatory sequences of the metallothionein gene 
(Brinster, et.al., 1982, Nature 296:39-42), etc. Any type of 
plasmid, cosmid, yac or viral vector can be used to prepare 
the recombinant DNA construct which can be introduced 
- directly into the tissue site. Alternatively, viral vectors 
30 can be used that selectively infect the desired tissue, in 
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Which case administration may be accomplished by another 
route (e.g., systemically) . 

Ribozyme molecules designed to catalytically cleave 
target gene mENA transcripts can also be used to prevent 
^ translation of target gene mRNA and, therefore, expression of 
'target gene product. (See, e.g., pct International 

Publication WO90/11364, published October 4, 1990; Sarver, et 
al., 1990, Science 247, 1222-1225). 

Ribozymes are enzymatic RKA molecules capable of 
catalyzing the specific cleavage of RWA. (For a review, see 
10 Rossi, 1994, current Biology 4:469-471). The mechanism of 
ribozyme action involves sequence specific hybridization of 
the ribozyme molecule to complementary target RNA, followed 
by an endonucleolytic cleavage event. The composition of 
ribozyme molecules must include one or more sequences 
complementary to the target gene mRNA, and must include the 
15 well known catalytic sequence responsible for mRNA cleavage. 
Por this sequence, see, e.g., U.S. Patent No. 5,093,246, 
which is incorporated herein by reference in its entirety. 

While ribozymes that cleave mRNA at site specific 
recognition sequences can be used to destroy target gene 
iDRNAs, the use of hammerhead ribozymes is preferred. 
20 Hammerhead ribozymes cleave mRNAs at locations dictated by 
flanking regions that form complementary base pairs with the 
target mRNA. The sole requirement is that the target mRNA 
have the following sequence of two bases: 5»-UG-3'. The 
construction and production of hammerhead ribozymes is well 
known in the art and is described more fully in Myers, 1995, 
25 Molecular Biology and Biotechnology: A Comprehensive Desk 
Reference, VCH Publishers, New York, (see especially Figure 
4, page 833) and in Haseloff and Gerlach, 1988, Nature, 
334:585-591, which is incorporated herein by reference in its 
entirety. 

30 Preferably the ribozyme is engineered so that the 

cleavage recognition site is located near the 5» end of the 
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t^get gene i.e., to increase efficiency a™. »lni»lze 

the intracellular acc^ulaticn of non-functional 
transcripts. 

ae rlbo^smes of the present invention also include BHA 
^ endoribonucleases (hereinafter "cech-type ribozvMes», sur^s 
the one that occurs naturally in ^etrahy^ena theLpLia 
(too™ as the ivs, or L-19 ivs r„a, and that has been 
extensively described by Tho^s Cech and collaborators (Zaug, 

ec al., 1984, Scxence, 224:574-578 • 7an« a,,^ ^ w 

' **.3/t o/B, zaug euid Cech, 1986, 

science, 231:470-475; Zaug, et ai., i<,s6. Nature, 324M2,- 

" 88/04C^T-""*^"°"^' """^^ application Ho. wo 
88/04300 by university Patents Inc.; Been and Cech, 1,86, 
cell, 47:207-216,. The Ceoh-type ribozy^es have a^ eigh; 
base pair active site which hybridizes to a target rha 
seguence whereafter cleavage of the target HKA takes place. 
IS ~ "1"'°°"'"""" ce^-ty^ ribozy»es which 

As in the antisense approach, the ribozymes can be 
coiupcsed Of modified oligonucleotides (e.g., for improved 
stability, targeting, etc., and should be delivered to cells 
that express the target gene in vivo. A preferred method of 
delivery involves using a DNA construct "encoding" the 
riiozyzae under the control of a strong constitutive pel Hi 

Lfl' "/""T' '° transfected cells will produce 

s^xcxent quantities of the ribozyme to destroy endogenous 
target gene messages and inhibit translation. Because 

« TZ^l r"^' -lecules, are catalytic, a lower 

intracellular concentration is required for efficiency. 

a^dogenous target gene expression can also be reduced by 
xnactxvatxng or "knocking out" the target gene or its 
promoter using targeted homologous recombination (e.g., see 
smithies, et al., 1985, Nature 317:230-234; Thomas and' 

30 Capecchi, 1987, cell 51:503-512; Thompson, et al., 1989, Cell 
5:313-321; each of which is incorporated by reference herein 
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m its entirety), por example, a mutant, non-functional 
target gene (or a completely unrelated DNA sequence) flanked 
by DNA homologous to the endogenous target gene (either the 
coding regions or regulatory regions of the target gene) can 
be used, with or without a selectable marker and/or a 
negative selectable marker, to transfect cells that express 
the target gene in vivo. Insertion of the DNA construct, via 
targeted homologous recombination, results in inactivation of 
the target gene, such approaches are particularly suited in 
the agricultural field where modifications to ES (embryonic 
stem) cells can be used to generate animal offspring with an 
inactive target gene (e.gr., see Thomas and Capecchi, 1987 and 
Thompson, 1989, supra). However this approach can be adapted 
for use in humans provided the recombinant DNA constructs are 
directly administered or targeted to the required site in 
vivo using appropriate viral vectors. 
*5 Alternatively, endogenous target gene expression can be 

reduced by targeting deoxyribonuoleotide sequences 
complementary to the regulatory region of the target gene 
(I.e., the target gene promoter and/or enhancers) to form 
triple helical structures that prevent transcription of the 
target gene in target cells in the body. (See generally, 
Helene, 1991, Anticancer Drug Des., 6(6) :569-584; Helene, et 
alw 1992, Ann. N.Y. Acad. Sci., 660:27-36; and Maher, 1992, 
Bioassays 14 (12) :807-8l5) . 

Nucleic acid molecules to be used in triplex helix 
formation for the inhibition of transcription should be 
25 single stranded and composed of deoxynucleotides . The base 
composition of these oligonucleotides must be designed to 
promote triple helix formation via Hoogsteen base pairing 
rules, which generally require sizeable stretches of either 
purines or pyrimidines to be present on one strand of a 
duplex. Nucleotide sequences may be pyrimidine-based, which 
30 wxll result in TAT and CGC* triplets across the three 
associated strands of the resulting triple helix. The 
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pyrimidine-rich molecules provide base complementarily to a 
purine-rich region of a single strand of the duplex in a 
parallel orientation to that strand, in addition, nucleic 
acid molecules may be chosen that are purine-rich, for 
example, contain a stretch of g residues. These molecules 
5 will form a triple helix with a DNA duplex that is rich in GC 
pairs, in which the majority of the purine residues are 
located on a single strand of the targeted duplex, resulting 
in GGC triplets across the three strands in the triplex. 

Alternatively, the potential sequences that can be 
targeted for triple helix formation may be increased by 
^0 creating a so called "switchback" nucleic acid molecule. 

Switchback molecules are synthesized in an alternating 5 '-a*, 
?'-5' manner, such that they base pair with first one strand' 
of a duplex and then the other, eliminating the necessity for 
a sizeable stretch of either purines or pyrimidines to be 
present on one strand of a duplex. 
15 In instances wherein the antisense, ribozyme, and/or 

triple helix molecules described herein are utilized to 
inhibit mutant gene expression, it is possible that the 
technique may so efficiently reduce or inhibit the 
transcription (triple helix) and/or translation (antisense, 
ribozyme) of mRNA produced by normal target gene alleles that 
the possibility may arise wherein the concentration of normal 
target gene product present may be lower than is necessary 
for a normal phenotype. in such cases, to ensxire that 
substantially normal levels of target gene activity are 
maintained, therefore, nucleic acid molecules that encode and 
express target gene polypeptides exhibiting normal target 
5 gene activity may, be introduced into cells via gene therapy 
methods such as those described, below, in Section 5.9.2 that 
do not contain sequences susceptible to whatever antisense, 
ribozyme, or triple helix treatments are being utilized. 
Alternatively, in instances whereby the target gene encodes 
an extracellular protein, it may be preferable to co- 
30 administer normal target gene protein in order to maintain 
the requisite level of target gene activity. 
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Anti-sense RNA and DNA/ ribozyme, and triple helix 
molecules of the invention may be prepared by any method 
known in the art for the synthesis of DNA and RNA molecules 
as discussed above. These include techniques for chemically 
synthesizing oligodeoxyribonucleotides and 
5 oligoribonucleotides well known in the art such as for 
example solid phase phosphoramidite chemical synthesis 
Alternatively, RNA molecules may be generated by in vitro and 
in vivo transcription of DNA sequences encoding the antisense 
RNA molecule. Such DNA sequences may be incorporated into a 
wide variety of vectors that incorporate suitable RNA 
polymerase promoters such as the T7 or SP6 polymerase 
promoters. Alternatively, antisense cDNA constructs that 
synthesize antisense RNA constitutively or inducibly, 
depending on the promoter used, can be introduced stibly into 
cell lines. 

15 

S»9,2. GENE REyrAGBMKWT THERAPY 

BKNGl gene nucleic acid sequences, described above in 
section 5.1, can be utilized for transferring recombinant 
EKKGl nucleic acid sequences to cells and expressing said 
sequences in recipient cells. Such techniques can be used, 
for example, in marking cells or for the treatment of a 
fflOTGl-mediated neuropsychiatric disorder. Such treatment can 
be in the form of gene replacement therapy. Specifically, 
one or more copies of a normal EKNGl gene or a portion of the 
EXNGl gene that directs the production of a EKNGl gene 

25 product exhibiting normal EKNGl gene function, may be 

inserted into the appropriate cells within a patient, using 
vectors that include, but are not limited to adenovirus, 
adeno-associated virus, and retrovirus vectors, in addition 
to other particles that introduce DNA into cells, such as 

. liposomes. 

30 B cause the HKNGl gene is expressed in the brain, such 

g ne replacement therapy techniques should be capable of 



wo 99/47535 



PCT/US99/OS606 



delivering HKNGl gene sequences to these cell types within 
patients. Thus, jLn one embodiment, techniques that are well 
known to those of skill in the art (see, e.g., PCT 
Publication No. WO89/10134, published April 25, 1988) can be 
g used to enable HKNGl gene sequences to cross the blood-brain 
barrier readily and to deliver the sequences to cells in the 
brain, with respect to delivery that is capable of crossing 
the blood-brain barrier, viral vectors such as, for example, 
these described above, are preferable. 

In another embodiment, techniques for delivery involve 
10 direct administration, e.g., by stereotactic delivery of such 
HKNGl gene sequences to the site of the cells in which the 
HKNGl gene sequences are to be expressed. 

Additional methods that may be utilized to increase the 
overall level of HKNGl gene expression and/or HKNGl gene 
^ product activity include using targeted homologous 

recombination methods, discussed in Section 5.2, above, to 
modify the expression characteristics of an endogenous HKNGl 
gene in a cell or microorganism by inserting a heterologous 
DNA regulatory element such that the dLnserted regulatory 
element is operatively linked with the endogenous HKNGl gene 
20 in question. Targeted homologous recombination can thus be 
used to activate transcription of an endogenous HKNGl gene 
that is "transcriptionally silent", i.e., is not normally 
e3q)ressed or is normally expressed at very low levels, or to 
enhance the expression of an endogenous HKNGl gene that is 
2^ normally e:q>ressed. 

Further, the overall level of HKNGl gene expression 
and/or HKNGl gene product activity may be increased by the 
introduction of appropriate HKNGl-expressing cells, 
preferably autologous cells, into a patient at positions and 
_ in numbers that are sufficient to ameliorate the syB5)toms of 



wo 99/47535 



PCT/US99/05606 



a HiTifGl-medlated neuropsychiatric disorder, such cells may 
be either recombinant or non-rcconbinant. 

Among the cells that can be administered to increase the 
overall level of HKNGl gene expression in a patient are 
5 normal cells, preferably brain cells, that express the HKNGl 
gene. Alternatively, cells,, preferably autologous cells, can 
be engxneered to express HKNGl gene sequences, and may then 
be introduced into a patient in positions appropriate for the 
amelioration of the symptoms of a JJCiTGl -mediated 
neuropsychiatric disorder. Alternately, cells that express 
an unimpaired HKNGl gene and that are from a MHC matched 
individual can be utilized, and may include, for example, 
brain cells. The expression of the EKNGl gene sequences is 
controlled by the appropriate gene regulatory sequences to 
allow such expression in the necessary cell types. Such gene 
„ regulatory sequences are well known to the skilled artisan, 
such cell-based gene therapy techniques are well known to 
those skilled in the art, see, e.^., Anderson, U.S. Patent 
No. 5,399,349. 

When the cells to be administered are non-autologous 
cells, they can be administered using well known techniques 
20 that prevent a host immune response against the introduced 
cells from developing. For example, the cells may be 
introduced in an encapsulated form which, while allowing for 
an exchange of components with the immediate extracellular 
environment, does not allow the introduced cells to be 
recognized by the host immune system. 
25 Additionally, compounds, such as those identified via 

techniques such as those described, above, in Section 5 8 
that are capable of modulating HKNGl gene product activity 
can be administered using standard techniques that are well 
known to those of skill in the art. m instances in which 
. the compounds to be administered are to involve an 
30 interaction with brain cells, the administration techniques 
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Should include well known ones that allow for a crossing of 
the blood-brain barrier. 



5.10. PHaRMACBOTICAL PREPARATIONS 

AND METH ODS OP APMIMISTOM'TOM 

5 The compounds that are determined to affect HKNGl gene 

expression or gene product activity can be administered to a 
patient at therapeutically effective doses to treat or 
ameliorate a IKOTGI -mediated disorder or modulate a HKNGl- 
related process described herein. A therapeutically 
effective dose refers to that amount of the compound 
sufficient to result in amelioration of symptoms of such a 
disorder. 

5.10.1. ECTECTIVE DOS^ 

Toxicity and therapeutic efficacy of such compounds can 
15 be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g., for determining the 
LDso (the dose lethal to 50% of the population) and the EDjo 
(the dose therapeutically effective in 50% of the 
population) . The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as 
20 the ratio LD50/ED50. Compounds that exhibit large therapeutic 
indices are prefeixed. While compounds that exhibit toxic 
side effects may be used, care should be taken to design a 
delivery system that targets such compounds to the site of 
affected tissue in order to minimize potential damage to 
uninfected cells and, thereby, reduce side effects. 
25 The data obtained from the cell culture assays and 

animal studies can be used in formulating a range of dosage 
for use in humans. The dosage of such compoTinds lies 
preferably within a range of circulating concentrations that 
include the ED30 with little or no toxicity. The dosage may 
vary within this range dep nding upon the dosage form 
30 employed and the route of administration utilized. For any 
compound used in the method of the invention, the 
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therapeutically effective dose can be estimated initially 
from cell culture assays. A dose may be formulated in animal 
models to achieve a circulating plasma concentration range 
that includes the IC^o (i.e., the concentration of the test 
^ compound that achieves a half-maximal inhibition of symptoms) 
as determined in cell culture. Such information can be used 
to more accurately determine useful doses in humans. Levels 
in plasma may be measured, for example, by high performance 
liquid chromatography. 

As defined herein, a therapeutically effective amount 
of antibody, protein, or polypeptide (i.e., an effective 
dosage) ranges from about o.OOl to 30 mg/kg body weight, 
preferably about 0.01 to 25 mg/kg body weight, more 
preferably about 0.1 to 20 mg/kg body weight, and even more 
preferably about i to 10 mg/kg, 2 to 9 mg/kg, 3 to 8 mg/kg, 4 
to 7 mg/kg, or 5 to 6 mg/kg body weight. 

The skilled artisan will appreciate that certain factors 
may influence the dosage required to effectively treat a 
subject, including but not limited to the severity of the 
disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. 
Moreover, treatment of a subject with a therapeutically 
effective amount of a protein, polypeptide, or antibody can 
include a single treatment or, preferably, can include a 
series of treatments, in a preferred example, a subject is 
treated with antibody, protein, or polypeptide in the range 
of between about O.l to"20 mg/kg" body weight, one time per 
week for between about l to lo weeks, preferably between 2 to 
8 weeks, more preferably between about 3 to 7 weeks, and even 
more preferably for about 4, 5, or 6 weeks. It will also be 
appreciated that the effective dosage of antibody, protein, 
or polypeptide used for treatment may increase or decrease 
over the course of a particular treatment, changes in dosage 
may result and become apparent from the results of diagnostic 

- assays as described herein. 
30 
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S-10.2. P RHOIATIONB AMD TOB 

Pharmaceutical compositions for use in accordance with 
the present Invention may be formulated in conventional 
manner using one or more physiologically acceptable carriers 
or excipients. 

Thus, the compoiands and their physiologically acceptable 
salts and solvates may be formulated for administration by 
inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral rectal or topical 
administration . 

10 administration, the pharmaceutical compositions 

may take the form of, for example, tablets or capsules 
prepared by conventional means with pharmaceutical ly 
acceptable excipients such as binding agents (e.g., 
pregelatinised maize starch, polyvinylpyrrolidone or 
hydroxypropyl methylcellulose) ; fillers (e.g., lactose, 
15 microcrystalline cellulose or calcium hydrogen phosphate) ; 
lubricants (e.g., magnesium stearate, talc or silica); 
disintegrants (e.g., potato starch or sodium starch 
glycolate); or wetting agents (e.g., sodium lauryl sulphate). 
The tablets may be coated by methods well known in the art. 
20 preparations for oral administration may take the form 

of, for example, solutions, syrups or suspensions, or they 
may be presented as a dry product for constitution with water 
or other sultiUsle vehicle before use. Such liquid 
preparations miay be prepared by conventional means with 
pharmaceutlcally acceptable additives such as suspending 
agents (e.g., sorbitol syrup, cellulose derivatives or 
hydrogenated edible fats); emulsifying agents (e.g., lecithin 
or acacia^; non-aqueous vehicles (e.g., almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils); and 
preservatives (e.g., methyl or propyl-p-hydroxybenzoates or 
. sorbic acid) . The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as 
appropriate. 
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Pr parations for oral administration may be suitably 
formulated to give controlled release of the active compound. 

For buccal administration the compositions may take the 
form of tablets or lozenges formulated in conventional 
manner. 

^ For administration by inhalation, the compounds for use 

according to the present invention are conveniently delivered 
in the form of an aerosol spray presentation from pressurized 
packs or a nebuliser, with the use of a suitable propellant, 
e.g., dichlorodif luoromethane , tr ichlorof luoromethane , 
dichlorotetrafluoroetheme, carbon dioxide or other suitable 

xo 

gas. In the case of a pressxirized aerosol the dosage unit 
may be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of e.g., gelatin for use in 
an inhaler or insufflator may be formulated containing a 
powder mix of the compoimd and a suitable powder base such as 
lactose or starch. 

The compounds may be formulated for peirenteral 
administration by injection, e.g., by bolus injection or 
continuous infusion. Formulations for injection may be 
presented in unit dosage form, e.g., in ampoules or in multi- 
dose containers, with an added preservative. The 

20 compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain 
formulatory agents such as suspending, stabilizing and/or 
dispersing agents. Alternatively, the active ingredient may 
be in powder form for constitution with a suitable vehicle, 
B.g., sterile pyrogen-free water, before use. 

^5 The compoionds may also be formulated in rectal 

con^jositions such as suppositories or retention enemas, e.g., 
containing conventional suppository bases such as cocoa 
butter or other glycerides. 

In certain embodiments, it may be desirable to 

- administer the pharmaceutical compositions of the invention 
3 0 

locally to the ar a in need of treatment. This may be 
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achieved by, for example, and not by way of limitation, local 
infusion during surgery, topical application, e.g., in 
conjunction with a wound dressing after surgery by 
injection, by means of a catheter, by means of a suppository 
^ or by means of an implant, said implant being of a porous ' 
non-porous, or gelatinous material, including membranes, ^uch 
as sialastic membranes, or fibers, m one embodiment 
administration can be by direct injection at the site' (or 
former site) of a malignant tumor or neoplastic or pre- 
neoplastic tissue. 

For topical application, the compounds may be combined 
with a carrier so that an effective dosage is delivered, 
based on the desired activity ' 

A topical formulation for treatment of some of the eye 
disorders discussed infra (e.g., myopia) consists of an 
effective amount of the compounds in a ophthalmologically 
^5 acceptable excipient such as buffered saline, mineral oil 
vegetable oils such as com or arachis oil, petroleum jelly 
Miglyol 182, alcohol solutions, or liposomes or liposome-ll]^e 
products. Any of these compositions may also include 
preservatives, antioxidants, antibiotics, immunosuppressants 
and other biologically or pharmaceutically effective agents ' 
20 which do not exert a detrimental effect on the compound. 

in addition to the formulations described previously 
the compounds may also be formulated as a depot preparation 
such long acting formulations may. be administered by 
implantation (for example subcutaneously or intramuscularly) 
or by intramuscular injection. Thus, for example, the 
25 compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly 
soluble derivatives, for example, as a sparingly soluble 
salt. 

The compositions may, if desired, be presented in a pack 
30 or dispenser device that may contain one or more unit dosage 
forms containing the active ingredient. The pack may for 
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example comprise metal or plastic foil, such as a blister 
pack. The pack or dispenser device may be accompanied by 
instructions for administration. 

6. EZ&HPLE: TSE HKNGl GENE OF CHROHOSOME 18 IS 
5 MSOCIATED WITH THE KEURGTSYCSJUTRIC DISORDER 

BAD 

In the Example presented in this Section, studies are 
described that define a narrow interval of approximately 27 
kb on the short arm of human chromosome 18 which is 
associated with the neiaropsychiatric disorder BAD. The 
10 interval is demonstrated to lie within the gene referred to 
herein as the HKNGl gene. 

€•1- MATERIALS AMD METHODS 
C*l*l* LIMKASB DISBODTLTaRTITrf 

15 Linkage disequilibrium (LD) studies were performed using 

DNA from a population sample of neuropsychiatric disorder 
(BP-I) patients. The population sample and LD techniques 
were as described in Escamilla et aJ., 1996, Jim J. Med. 
Genet. ^7: 244-253. The present LD study took advantage of 
the additional population sample collection and the 

20 additional physical markers identified via the physical 
mapping techniques described below. 

6.1.2. YBAST ARTI FIGIAZ. IGHROMOSQMB (YACI MAPPIHQ 

Por physical mapping, yeast artificial chromosomes 
(YACs) containing humem sequences were mapped to the region 
being analyzed based on publicly available maps (Cohen et 
al., 1993, C.R. Acad. Sci. 316:1484-1488). The YACs were 
then ordered and contig reconstructed by performing stande^-d 
short tag sequence (STS) -content mapping with microsatellite 
markers and non-polymorphic STSs available from databases 
30 surround the genetically defined candidate region. 
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BACTERIAL ARTIPIOTAT. nTT RPMOSOMB #PAC) MRPPTWO 
STSs from the short arm of human chromosome 18 were 
used to screen a human BAG library (Research Genetics, 
HuntsvlUe, AL) . The ends of the BACs were cloned or 
^ directly sequenced. The end sequences were used to amplify 
the next overlapping BACs. Prom each BAG, additional 
microsatellites were identified. Specifically, random 
sheared libraries were prepared from overlapping BAGs within 
the defined genetic interval. BAG DKTA was sheared with a 
nebulizer (CIS-OS Inc., Bedford, MA). Fragments in the size 
range of 600 to 1,000 bp were utilized for the sublibrary 
production. Microsatellite sequences from the sublibraries 
were identified by corresponding microsatellite probes. 
Sequences around such repeats were obtained to enable 
development of PGR primers for genomic DNA. 

IS 6.1.4. RADIATIOM HYBRTn ( I ffll MAPPTWf! 

Standard RH mapping techniques were applied to a 
Stanford G3 RH mapping panel (Research Genetics, Huntsville,. 
AL) to order all microsatellite markers and non-polymorphic' 
STSs in the region being analyzed. 

20 6.1,5. SAMPLE BEOOENGING 

Random sheared libraries were made from all the BACs 
within the defined genetic region. Approximately 9,000 
subclones within the approximately 340 kb region containing 
the BAD interval were sequenced with vector primers in order 
to achieve an 8-fold sequence coverage of the region. All 

25 sequences were processed through an automated sequence 
analysis pipeline that assessed quality, removed vector 
sequences and masked repetitive sequences. The resulting 
sequences were then compared to public DNA and protein 
databases using BLAST algorithms (Altschul, et aJ., 1990, j. 

_ Molec. Biol., 215:403-410).' 
30 
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All se^enoes were contigad using Sequenoher 3.0 (Gene 
code ccrp , and phred and PHRAP (PhlU Green, Washington 
Dniversxty) into a single DNA fragment of 340 kb. 

5 6.2. RESDLTS 

,«,n>T'"° '"^"'"'"^ " "t"^" affective disorder 

(BAD) human genes had previously been reported to map to 

portions Of the long ,i8g, and short ,i8p, arms of hLn 

Chromosome 18 (Prelmer et al., Heuropsychiat . Genet. 

67:254-263; Freimer et aj., 1996, Nature Genetics 12:436-441- 
1. and Kclnnis et al., «roc. ^cad. Scie. a.s.^. ,3:130«-' 

13065). 

Htoh re.soim-ion 

in order to provide the precise order of genetic 
marlcers necessary for linkage and mapping, and to ,!lde 
new ..crosatellite marker development for finer mappl^, a 
15 high resolution physical map of the l8p candidate regll; L 
developed using yac, bac and rh technlgues. 

For such physical mapping, first, yacs were mapped to 
the chromosome X8 region being analyeed. osing the mapped 
»c oontxg as a frameworlc, the region from publicly avaUabl- 

n^Sfff ' f " -icrosatellite marker secp.ences were 

laentified and sequenced. 

r.«, r ^'^'^^^^"^^'^'^ « an accurate ^physical map, the 
radiation hybrid (8H, mapping technigue was independently 
applied to the region being analysed. SH was used to order 
25 all microsatellite markers and non-polymorphic srss in the 
region. Thus, the high resolution physical map ultimately 
co^tr«*ed was obtained using data from m mapping and sL- 
content mapping. 

Mnkag^ p^.P^„n^^r^,^ Prior to attempting to identify 
gene sequences, studies were performed to further narrow the 

•UseguiUbrlum (UJ) analysU was performed using p pulation 
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samples and techniques as described in Section 6.1, above 
which took advantage of the additional physical markers 
identified via the physical mapping techniques described 
below. 

Initial LD analysis narrowed the interval which 
5 associates with BAD disorders to a 340 kb region of I8p. BAC 
clones within this newly identified neuropsychiatric disorder 
region were analyzed to identify specific genes within the 
region. A combination of sample sequencing, cDNA selection 
and transcription mapping analyses were used to arrange 
sequences into tentative transcription units, that is 
" tentatively delineating the coding sequences of genes within 
this genomic region of interest. 

subsequent LD analyses further narrowed the BAD region 
of I8p to a narrow interval of approximately 27 kb. This was 
accomplished by identifying the maximum haplotype shared 
among affected individuals using additional markers. 
15 Statistical analysis of the entire 18p candidate region 

indicated that the 27 kb haplotype was significantly elevated 
in frequency among affected Costa Rican individuals 
(LOD = 2.2; p = 0.0005). 

This newly identified narrow interval was found to map 
^ completely within one of the transcription units identified 
as described above. The gene corresponding to this 
transcription unit is referred to herein as the HKNGl gene. 
Thus, the results of the mapping analyses presented in this 
section demonstrate that the itlOTGi gene of human chromosome 
18 is associated the neuropsychiatric disorder BAD. 
25 Analysis of the BAD interval indicated that the 27 kb 

BAD disease-associated chromosomal interval identified in the 
linkage disequilibrium studies is contained within an 
approximately 60 kb genomic region which contains a sequence 
referred to as GS4642 or rod photoreceptor protein (RPP) gene 
(Shimizu-Matsumoto, A. et al., 1997, invest. Ophthalmol, vis. 
Sci. 38:2576-2585). 
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7. EXAMPLE: SEQUENCE AM) CHARACTERIZAriON 

OF TH g HKNG1 nvnsv^ 

AS demonstrated in the Example presented in Section 6 
above, the HKNGl gene is involved in the neuropsychiatric ' 
^ disorder BAD. The results presented in this Section further 
characterxze the HKNGl gene and gene product, m particular 

a^Id'cnMr '""^ ^"^^^-^ °^ i= ' 

and CDNA sequences have revealed both the full length HKNGl 

amino acid sequence and the HKNGl genomic intron/exon 
structure, m particular, the nucleotide and predicted amino 
10 acid sequence of the HKNGl gene identified by these analyses 
disclose new HKNGl exon sequences, including new HKNGl 
protein coding sequence, discovered herein. Further, the 
expression of HKNGl in human tissue, especially neural 
tissue, is characterized by Northern and in situ 
„ hybridization analysis. The results presented herein are 
consistent with the HKNGl gene being a gene which mediates 
neuropsychiatric disorders such as BAD. 

MATERiar.S AND MBTwnna 
HKNGl CDNA none Tsmal-lon : Hybridization of a human 
brain and kidney cDNA library was performed according to 
standard techniques and identified a full-length HKNGl cDNA 
clone, in addition, a HKNGl cDNA derived from a splice 
variant was isolated, as described in Section 7.2, below. 
Northern Blot ftnaly ^lr; standard RNA isolation 
„ techniques and Northern blotting procedures were followed 
The HKNGl probe utilized corresponds to the complementary' 
sequence of base pairs 1367 to 1578 of the full length HKNGl 
CDNA sequence (SEQ id NO. 1) . clontech multiple tissue 
northern blots were probed, m particular, clontech human I, 

30 brain m blots were utilized for this study. 
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gitu Hvhrldi^;.t.i»r^ standard In situ 

hybridization techniques were utilized. The HKNGl probe 
utilized corresponds to the complementary sequence of base 
pairs 910 to 1422 Of the full length HKNGl cDNA sequence (SEQ 
5 ID NO. 1) . Brains for in situ hybridization analysis were 
Obtained from McLean Hospital (The Harvard Brain Tissue 
Resource Center, Belmont, MA 02178) . 

Other technirp ie^r: The remaining techniques described in 
section 7.2, below, were performed according to standard 
techniques or as discussed in Section 5.1, above. 

10 

7.2. RESDI.Ta 

ggWgl Nucleotide ii» ino Aeid 

A human brain cDNA library was screened and a full- 
length Clone Of HKNGl was isolated from this library, as 
described above. By comparing the isolated cDNA seqilence to 
sequences in the public databases, a clone was identified 
which had been previously identified as GS4642, or rod 
photoreceptor protein (KPP) gene (GenBank Accession No. 
D63813; Shimizu-Matsumoto, A. et al., 1997, invest. 
Ophthalmol. Vis. sci. 38:2576-2585). Although Shimizu- 
Matsumoto et al. refer to GS4642 as a full-length cDNA 
sequence, the isolated HKNGl cDNA extends approximately 200 
bp beyond the 5'end of the identified GS4642 clone. 

importantly, the HKNGl clone isolated herein reveals 
that, contrary to the amino acid sequence described in 
Shimizu-Matsumoto et al., the full length SKNGl amino acid 
25 sequence contains an additional 29 amino acid residues N- 
terminal to what had previously been identified as the full- 
length RPP (SEQ ID NO: 64). The full-length HKNGl nucleotide 
sequence (SEQ ID NO: i) and the derived amino acid sequence 
Of the full-length HKNGl polypeptide (SEQ ID NO: 2) encoded 
- by this sequence are depicted in FIG. lA-lB. 

The full-length HKNGl polypeptide was found to contain 
two clusterin similarity domains: clusterin similarity 
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domain 1 which corresponds to amino acid residues 134 to 
amino acid residue 160, and clusterin similarity domain 2 
which corresponds to amino acid residue 334 to amino acid 
residue 362. Such cluterin domains are typically 
characterized by five shared cysteine residues. In clusterin 
5 domain 1, these shared cysteine residues correspond to Cys 
134, Cysl45, Cysl48, Cysl58, and Cys 160. The shared 
cysteine residues in clusterin domain 2 correspond to the 
residues Cys334, Cys344, Cys351, Cys354, and Cys362. 

Full-length HKNGl cDNA sequence was compeired with the 
genomic contig completed by random sheaured library 
sequencing. Exon-intron boundeuries were identified manually 
by aligning the two sequences in Sequencher 3.0 and by 
deserving the conservative splicing sites where the 
alignments ended. This sequence comparison revealed that the 
additional cDNA sequence discovered through isolation of the 
full-length HKNGl cDNA clone actually belongs within three 
HKNGl exons. 

Prior to the isolation and analysis of HKNGl cDNA 
described herein, nine exons were predicted to be present 
within the corresponding genomic sequence. As discovered 
herein, however, the HKNGl gene, in contrast, actually 
contains 13 exons, with the new cDNA containing sequence 
which corresponds to a new exon 1, exon 2 and a 5* extension 
of what had previously been designated exon !• Splice 
vsoriants, discussed in Section 9 below, also exist which 
comprise additional exons 2* and 2". The genomic sec[uence 
and intron/exon structure of the HKNGl gene is shown in FIG. 
*® 3A-3R. 

The breakdown of exons was confirmed by the perfect 
alignment of the cDNA sequence with the genomic sequence and 
by observation of expected splicing sites flanking each of 
the additional, newly discovered exons. 

HKNGl nucleotide sequence was used to seeurch databases 

30 

of partial sequences of cDNA clones. This search identified 
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5 



a partial cDNA sequence derived from IMAGE clone R61493 
having similarity to the human HKNGl sequence. IMAGE clone 
R61493 was obtained and consists of a cDNA insert, the Lafmid 
BA vector backbone, and DMA originating from the oligo dT 
primer and Hind III adaptors used in cDNA library 
construction. The Lafmid BA vector nucleotide sequence is 
available at the DRL http://image.rzpd.de/lafmida_seq.html 
and descriptions of the oligo dT primer and Hind III adaptors 
are available in the GENBANK record corresponding to 
accession number R61493. 

The sequence of the cDNA insert revealed that the insert 
was derived from an alternatively spliced HKNGl mRNA variant, 
referred to herein as HKNGl-Vl. In particular, this HKNGl 
variant is deleted for exon 3 of the full length 13 exon 
HENGl sequence. The nucleotide sequence of this HKNGl 
variant (SEQ ID NO:3) is depicted in FIG. 2A-B. The amino 
*S acid sequence encoded by the HKNGl variant (SEQ ID NO: 3) is 
also shown in FIG. 2A-B. 

Preferably therefore, the nucleic acids of the invention 
include nucleic acid molecules comprising the nucleotide 
sequence of HKNGl-vi or encoding the polypeptide encoded by 
2^ EENGl-Vl in the absence of heterologous sequences (e.g., 
cloning vector sequences such as Lafmid BA; oligo dT primer, 
and Hind III adaptor) , 

'•2.2. HKUai GENE EmaBSSIOM 
EKNGl gene expression was examined by Northern blot 
2* analysis in various human tissues, a transcript of 

approximately 2 kb was detected in fetal brain, lung and 
kidney, and in adult brain, kidney, pancreas, prostate, 
testis, ovary, stomach, thyroid, spinal cord, lymph node and 
trachea. An approximately 1.5 kb transcript was also seen in 
- trachea, in addition, a larger transcript of approximately 5 
30 kb was detected in all adult neural regions tested (that is, 
cerebellum, cortex, medulla, spinal cord, occipital pole, 
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frontal lobe, temporal, putanen, amygdala, caudatte nucleus 
corpus callosum, hippocampus, whole brain, substantia nigra, 
subthalamic nucleus and thalamus) . Once again, this is in 
dxrect contrast to previous Northern analysis of the rpp 
^ gene, which reported that expression was limited to the 
retina (Shimizu-Matsumoto, A. et al., 1997, Invest. 
Ophthalmal. vis. Sci. 38:2576-2585). 

Analysis of HKNGl the tissue distribution was extended 
through an in situ hybridization analysis, m particular 
the HKNGl mRNA distribution in normal human brain tissue was 
10 analyzed. The results of this analysis are depicted in FIG 
4. AS summarized in PIG. 4, HKNGl is expressed throughout 
the brain, with transcripts being localized to neuronal and 
grey matter cell types. 

Finally, expression of HKNGl in recombinant cells 
demonstrates that the HKNGl gene encodes a secreted 
polypeptide (s) . 

8. A HIS8BNSE MOTATXON WITHIN HKNGl 
COHREIATES WTTH BAD 

The Example presented in Section 6, above, shows that 
the BAD disorder maps to an interval completely contained 
within the EKNGl gene of the short arm of human chromosome 
18. The Example presented in Section 7, above, characterizes 
the EKNGl gene and gene products. The results presented in 
this Example further these studies by identifying a mutation 
within the coding region of a HKNGl allele of an Individual 
25 exhibiting a BAD disorder. 

Thus, the results described herein demonstrate a 
positive correlation between a mutation which encodes a non- 
wxld-type EKNGl polypeptide and the appearance of the 
neuropsychiatric disorder BAD. The results presented herein, 
^. coupled with the r suits presented in Section 6, above. 
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identify HKNGl as a gene which mediates neuropsychiatric 
disorders such as BAD. 

®«^» MATERIAL S AND METTTnna 
5 Pairs of PCR primers that flank each exon (see TABLE 1 

above) were made and used to PCR amplify genomic DNA isolated 
from BAD affected and normal individuals. The amplified VCR 
products were analyzed using SSCP gel electrophoresis or by 
DNA sequencing. The DNA sequences and SSCP patterns of the 
affected and controls were compared and variations were 
further analyzed. 



8.2. RESDLTg 

in order to more definitively show that the HKNGl gene 
mediates neuropsychiatric disorders, in particular BAD, a 
study was conducted to explore whether a HKNGl mutation that 
^5 correlates with BAD could be identified. 

First, exon scanning was performed on all eleven exons 
of the HKNGl gene using chromosomes isolated from three 
affected and one normal individual from the Costa Rican 
population utilized for the LD studies discussed in Section 
6, above. No obvious mutations correlating with BAD were 
found through this analysis. 

Next, HKNGl intron and 3 '-untranslated regions within 
the 27 kb BAD interval were scanned by sscp and/or sequencing 
for all variants among three affected and one normal 
individual from the same population. Approximately 60 
25 variants were identified after scanning approximately two- 
thirds of the 27 kb genomic interval, which can be genotyped 
and analyzed by haplotype sharing and LD analyses, as 
described above, in order to identify ones which correlate 
with bipolar affective disorder. Fig. 5 lists selected 
variants identified through this study, 
j-jj Exon scanning using chromosomal . DNA from the general 

population of Costa Rica, however, successfully identified a 



- 110 - 



wo 99/47535 



PCTAJS99/0S6O6 



HKNGl aissense mutation in an individual affected with BAD 
who did not share the common diseased haplotype identified by 
the LD analysis provided above. In particular, exon scanning 
was done on exons l-ii of HKNGl nucleic acid from 129 
5 individuals from the general population affected with BAD. 

This analysis identified a point mutation in the coding 
region of exon 7 not seen in non-bipolar affected disorder 
individuals, specifically, the guanine corresponding to 
nucleotide residue 604 of SEQ ID NO:l (or nucleotide residue 
550 of SEQ ID N0:3) had mutated to an adenine. HKNGl protein 
10 expressed from this mutated HKNGl allele comprises the 

substitution of a lysine residue at amino acid residue 202 of 
SEQ ID NO: 2 (or amino acid residue 184 of SEQ ID NO: 4) in 
place of the wild-type glutamic acid residue. 

Additional HKNGl polymorphisms relative to the HKNGl 
wild-type sequence, and which, therefore, represent HKNGl 
alleles, were identified through sequence analysis of the 
HKNGl alleles within a collection of schizophrenic patients 
of mixed ethnicity from the United States and within a BAD 
collection from the San Francisco area. These variants are 
depicted in FIGS. 5A and 5B, respectively. Statistical 
20 analysis indicated that there were significantly more 

variants in the collection of schizophrenic patients of mixed 
ethnicity from the United States and the San Francisco BAD 
and Costa Rican BAD samples than in a collection of 242 
controls (p < 0.05) . 

25 ®* EXAMPLE: IDEHTIFZCanON OP AODITIOHAL 

HKNGl SPLICE ViVttTawPa 

This example describes the isolation and identification 
of three novel splice variants of the human gene HKNGl. 
First, a novel HKNGl clone was isolated from a human retinal 
CDNA library. This clone, which completely lacks exon 7 of 
30 the full length HKNGl cDNA sequence, is referred to herein as 
HKNG1A7. Because the deletion of exon 7 from the full length 
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HKNGl sequence leads to an immediate frameshift, the clone 
HKNG1A7 encodes a truncated form of the HKNGl protein. The 
HKNG1A7 cDNA sequence (SEQ ID NO: 65) is depicted in FIG. 18 
along with the predicted amino acid sequence (SEQ ID NO:66) 
5 of the HKNGia? gene product it encodes. 

TWO other novel splice. variants, referred to herein as 
HKNG1.V2 and HKNG1-V3, were isolated and identified by using 
RT-PCR analysis to isolate additional HKNGl sequences. The 
following primer sequences were used: 

5.-AGTTGCGTCCCTCrCTG™-3. (SEQ ID NO: 67) 

S.-GCrrCATGTOCCCGCTGTTA-S. (SEQ ID NO: 68) 

-These splice variants included additional exons between exons 
2 and 3 of the full length HKNGl sequence -fSEQ id NO:l). 
The RT-PCR product derived from HKNG1-V2 includes a 

^5 novel exon referred to as "exon 2' whereas the rt-pcr 

product derived from HKNG1-V3 includes a novel exon referred 
to as "exon 2" ... The sequence of these novel exons are 
provided in Table 2 below. The nucleotide sequence of the 
HKNG1-V2 RT-PCR product containing novel exon 2' is depicted 
in PIG. 6A (SEQ ID NO:36), whereas the HKNG1-V3 RT-PCR 

2^ product containing novel exon 2' is depicted in FIG. 6B (SEQ 
ID NO:37). Both exon 2' and 2" are part of the 5'- 
untranslated region of the HKNGl cDNA. 



XABLB 2 



25 



Exon 2" 5 . -TTCCCTCCCTTTGGAACGCAGCGTCGGCACC (SEQ ID NO: 34) 

T6CAACGCAGAGACCACTGTATCCCCGGTGCA6A 

ATGTAATGAGTGCCTGATACATTTGCCGAATAAA 

CTATTCCAAGG6TTGAACTTGCTGGAAGCAAGAG 
AAGCACTATTCTGG-3 • 



30 
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Exon 2" 5'-ATGGAGTCTTGCTCTCGTTGCCCAGACTGGA (SEQ DI NO: 35) 
GTGCACT6CTGCGATCTCAGCTCACTGCAACCTC 
TACCTCCCAGGTTCAAGCGATTCTCCTGCCTCAG 
CCTCTCGAGTGGCTGGGACTATAG-3 • 



EXAMPLE; IDEMTIPIC ATIOH OP H KNGl ORTWnT.ncB 

This example describes the isolation and 
characterization of genes in other mammalian species which 
are orthologs to human HKNGl, Specifically, both guinea pig 
and bovine HKNGl sequences eure described. 

10 

CUIKEA P IG mCNfil ORTOOI.QSS 

A guinea pig HKNGl ortholog, referred to as gpbknglBis, 
was isolated using RT-PCR. The cDNA sequence (SEQ ID NO: 38) 
and predicted amino acid sequence (SEQ ID NO: 39) are depicted 
in FIG. 7. Both the nucleotide and the predicted amino acid 
sequence of gphkng 1815 are similar to the human HKNGl 
nucleotide and amino acid sequences. Specifically, the 
program ALIGNV2.0 identified a 71.5% nucleotide sequence 
identity and a 62.8% amino acid sequence identity using 
standard parameters (Scoring Matrix: PAM120; GAP penalties- 
20 -"^-^>- 

Like the human HKNGl polypeptide, the predicted 
gphkng 1815 polypeptide also contains two clusterin 
similarity domains, which correspond to amino acid residues 
105 to 131 (clusterin domain l) , and amino acid residues 305- 
333 (clusterin domain 2) , respectively. Both of these 
domains contain the five conserved cysteine residues 
typically associated with clusterin domains. Specifically, 
these conserved cysteines correspond to Cysl05, Cysll6, 
Cysll9, Cysl24 and Cysl31 (clusterin similarity domain i) and 
Cys305, Cys315, Cys322, Cys325, and Cys333 (clusterin 
similarity domain l) of the gphkng 1815 polypeptide sequence. 
3^ Three allelic variants of gphkng 1815, referred to as 

gphkng 7b, gphkng 7c, and gphkng 7cl, respectively, w re also 
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identified by rt-pcr. Their nucleotide [SEQ ID NO: 40 
(gphkng 7b), SEQ ID NO: 42 (gpbkng 7c), and SEQ ID NO: 44 
{gphkng 7d) ] and amino acid [SEQ ID NO: 41 {gphkzig 7b), SEQ id 
no: 43 (gphkng 7c), and SEQ ID NO: 45 {gphkng 7d)] sequences 
5 are depicted in FIGS. 8 through lO, respectively. Each of 
these three allelic variants . contains a deletion within a 
region homologous to exon 7 of human HKNGl. The allelic 
variants retain the open reading frame of the gene, however, 
each allelic variant contains a deletion, relative to gphkr^g 

10 °^ residues, respectively. 

An alignment of the predicted amino acid sequences of 
gphkngiais, gphkng 7b, gphkng 7c, and gphkng7d is shown in 
PIG. 14. An alignment of the predicted amino acid sequences 
of the human HKNGl gene product, the guinea pig HKNGl 
ortholog gphkngl815, and the bovine HKNGl ortholog described 

J.5 in Subsection 10.2 below are shown in FIG. 16. 

3-0.2, BOVIHE HKWGl OTt'p ^nj.naR 

Bovine orthologs of HKNGl were also cloned by screening 
a cDNA library made from pooled bovine retinal tissue using a 
nucleotide sequence that corresponded to the complementary 
20 sequence of base pairs 910-1422 of the full length human 
HKNGl CDNA sequence (SEQ ID NO:l) as a probe. Three 
independent bovine cDNA species, referred to as bhkngl, 
bhkng2, and bhknga (SEQ ID NOS: 46 to 48, respectively) were 
isolated. Each of these allelic variants contains several 
single nucleotide polymorphisms (SNPs) . None of the SNPs 
25 results in an altered predicted amino acid sequence. Thus 
all three bovine cDNAs encodes the same predicted amino acid 
sequence (SEQ ID NO: 49). These SNPs apparently reflect the 
natural allelic variation of the pooled cDNA library from 
which the sequences were isolated. Each of the three bovine 
- HKNGl allelic variants is depict d in figs, ii to 13, 
30 respectively, along with the predicted amino acid seqilence 
which they encode. 
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The predxcted bovine HKNGl polypeptide also contains two 
clusterxn sx^ilarity domains, corresponding to a.ino acid 
residues 105-131 and amino acid residues 304-332 
respectively, of SEQ ID HO:49. clusterin do.ain'i contains 
the five Shared cysteine amino acid residues tyx,ically 
associated with this type of domain: cysios, cislie/cysiis 
Cysl24 and Cysi3i. clusterin domain 2 of the hovin; Skci 
polypeptide contains four conserved cystein residues- 
Cys314, Cys321, Cys324, and Cys332. 

gPREBSroW OF Bmniv mr^^, SEMF. pp^nrrrvn 

Described in this example is the construction of 
expression vectors and the successful expression of 
recombinant human HKNGl sequences. Expression vectors are 
described both for native HKNGl and for various hkngi fusion 
proteins. 

15 

A human HKNGl flag epitope-tagged protein (HKNGl: flag) 
vector was constructed by PCR followed by ligation into an 
vector for expression in HEK 293T cells. The full open- 
reading frame of the full length HKNGl cDNA sequence (SE« id 
„ K0:5) was PCR amplified using the following primer sequences: 



...^Mw^ Kv^^ucsiii^cia 

3' primer 5 • -ITTTTGTCCACaSiaTCACTTGTCGTCGTC tcvn TT^ 

GTCCTTGTAGTCcSStT5SSTCSS5 ^^^^ N«>-*53) 
TAAAAT6C-3 • 

the 5. primer incorporating a Kozak sequence upstream of and 
2S including the upstream initiator methionine and the 3.primer 
including the nucleotide sequence encoding the flag epitope 
DYKDDDDK (SEQ ID N0:50) followed by a termination codon 

The sequenced DNA construct was transiently transfected 
into EEK 293T cells in 150 mm plates using Lipof ectamine 
. (GIBCO/BRL) according to the manufacturer's protocol. 
30 Seventy-two hours post-transfection, the serum-free 

conditioned medium (OptiHEM, GIBCO/brl) was harvested and 



- 115 - 



wo 99/47535 PCTAJS99/05606 

spun and the remaining monolayer of cells was lysed using 
2 mL Of lysis buffer [50 m Tris pH 8.0, 150 m NaCl i% 
NP-40, 0.05% SDS with "Complete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
instructions] . Insoluble material was pelleted before 
preparation of SDS-PAGE samples. 

Conditioned medium was electroblotted onto a PVDP 
membrane (Novex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an M2 anti-flag polyclonal 
antibody (1:500, Sigma) followed by horseradish peroxidase 
(HHP) conjugated sheep anti-mouse antibody (irsooo, 
10 Amersham) , developed using chemiluminescent reagents 

(Renaissance, Dupont) , and exposed to autoradiography film 
XBiomax MR2 film, Kodak) . Flag immunoreactivity appeared as 
a doublet of bands that migrated by SDS-PAGE between 60 and 
95 kDa as determined by Multimark molecular weight markers 
(Novex), demonstrating secretion of the HKNGl:Flag protein 
The double band Indicates at least two different species with 
different mobilities on SDS-PAGE. Such doublets most 
commonly arise with posttranslational modifications to the 
protein, such as glycosylation and/or proteolysis. Treatment 
of the PNGase F (Oxford Glycosciences) according to the 
manufacturer's directions resulted in a single band of 
increased mobility, indicating that two original bands 
contain N-llnked carbohydrate, when run in the absence of a 
reducing agent, the relative mobility of the immunoreactive 
bands was greater than lOO kDa relative to the same markers 
indicating that HKHGl:flag fusion proteins may be a disulfide 
linked dimer or higher oligomer. 

25 

"•2' g^CPREflBION OP Hronvw TONGl-Vi.iyT.a« 

A human HKNGl-Vl flag epitope-tagged protein (HKNGl- 
VI: flag) vector was also constructed by PCR followed by 
ligation into an expression vector, pMET stop. The full 
^- length open-reading frame of the HKNGl-Vl cDNA sequence (SEQ 
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ID NO: 6) was PGR amplified using the following primer 
sequences : 

5« primer SJ^-TOTTTCTGAATTCACCATGAGG^^^ (SEQ ID NO -54) 

ACTACAGTAAC-3 ' 

3' primer 5«-TTTTTGTCGACTTATCACTTGTC6TCGTC (SEQ ID NO- 53 ^ 
5 GTCCTTGTAGTCCCAGGTTTTAAAATGTTCCT w -i" 

TAAAATGC-3 • 

The 5' primer incorporated a Kozak sequence upstream of and 
including the upstream initiator methionine. The 3« primer 
included the nucleotide sequence encoding the flag epitope 
DYKDDDDK (SEQ ID N0:50) followed by a termination codon. 

The sequenced DNA construct was transiently transfected 
into HEK 293T cells in 150 mm plates using Lipofectamine 
(GIBCO/BRL) according to the manufacturer's protocol. 
Seventy-two hours post-transfection, the serum-free * 
conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
spun and the remaining monolayer of cells was lysed using 
2 mL of lysis buffer £50 mH Tris pH 8.0, 150 mM NaCl, i% 
NP-40, 0.05% SDS with "Complete" protease cocktail 
(Boehringer Hannheim) diluted according to manufacturers 
instructions] . Insoluble material was pelleted before 
preparation of SDS-PAGE samples. 

Conditioned mediinn was electroblotted onto a PVDP 
membrane (Novex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an M2 anti-flag polyclonal 
antibody (1:500, Sigma) followed by horseradish peroxidase 
(HRP) conjugated sheep anti-mouse antibody (l:5000, 
Amersham) , developed using cheml luminescent reagents 
(Renaissance, Dupont) , and exposed to autoradiography film 
(Biomax MR2 film, Kodak). Flag immunoreactivity appeared as 
a doublet of bands that migrated by SDS-PAGE between 60 and 
95 kDa as determined by Multimark molecular weight markers 
(Novex), demonstrating secretion of the HKN6l:Plag protein. 
When run in the absence of reducing agent, the relative 
mobility of the immunoreactive bands was greater than 100 kDA 
30 "Native to the same markers, suggesting that the HKMGi- 
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VI I flag fusion protein may be a disulfide linked dimer or 
higher oligomer. 



"•3' EXPREggl QH OP HnM>N HKNgUP^ 

^ A human HKNGl/hlgGlFc fusion protein vector was 

constructed by PCR. The full-length open-reading frame of 
the full length HKNGl cDNA (SEQ ID NO: 5) was PCR amplified 
using the following primer sequences: 

5' primer 5'-iTT^ 

CAGAGAAAAACG-3 • wu.odj 

3' primer 5 • -TTTTTGGATCCGCTGCTGCCCAGGTTTTA tsvn rn iift.K«\ 
" AAATGTTCCTTAAAATGC-3 • "^"^"-^^^ (SEQ ID NO: 56) 

The 5- primer incorporated a Kozak sequence before the 
upstream methionine to the amino acid residue before the stop 
codon. The 3 • PGR primer contained a 3 alanine linker at the 
Dunction of HKNGl and the human igGl Fc domain, which starts 
IS at residues DPE. The genomic sequence of the human IgGl Fc 
domain was ligated along with the PCR product into a pCDM8 
vector (invitrogen, Carlsbad CA) for transient expression. 

The sequenced DNA construct was transiently transfected 
into HEK 293T cells in 150 mm plates using Lipofectamine 
(GIBCO/BRL) according to the manufacturer's protocol. 
20 Seventy-two hours post-transfection, the serum-free 

conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
spun and the remaining monolayer of cells was lysed using 
2 mL of lysis buffer [50 mM Tris pH 8.0, 150 mM NaCl, 1% 
NP-40, 0.05% SDS with -complete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
25 instructions] . insoluble material was pelleted before 
preparation of SDS-PAGE samples. 

conditioned medium was electroblotted onto a PVDF 
membrane (Novex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an M2 anti-Fc polyclonal 
antibody (1:500, Jackson ImmunoResearch Laboratories, inc.) 
30 followed by horseradish peroxidase (HRP) conjugated sheep 
anti-mouse antibody (l:5000, Amersham) , developed using 
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chemiluinxnesc nt reagents (Renaissance, Dupont) , and exposed 
to autoradiography f ii^n (Biomax MR2 film, Kodak) . Human IgGl 
Fc imunoreactivity appeared as a doublet of bands that 
mxgrated by SDS-page between 148 and 60 kDa standards of the 
Multimark molecular weight markers (Movex) , demonstrating 
» secretion of the HKNGl:Pc fusion protein. 

EXPRESSTQW OP mmnv HKNei-vi.g., 
A human HKNGl-Vl/hlgGlFc fusion protein (HKNGl-Vl-Fc) 
vector was also constructed by PCR. The full-length open 
reading frame of HKNGl-Vl cDNA (SEQ ID NO: 6) was PGR 
amplified using the following primer sequences: 

^;iJ^^3-?A^^CCATGAGGACCTGGG (SEQ ID N0:57, 

(SEQ IDNO:56, 

15 The 5- primer incorporated a Kozak sequence before the 

upstream methionine to the amino acid residue before the stop 
codon. The 3. PGR primer contained a 3 alanine linker at the 
junction of HKNGl-vi and the human IgGl Fc domain, which 
starts at residues DPE. The genomic sequence of the human 
IgGl FC domain was ligated along with the PGR product into a 
20 PCDM8 vector for transient expression. 

The sequenced DNA construct was transiently transfected 
into HEK 293T cells in 150 mm plates using Lipofectamine 
(GIBCO/BRL) according to the manufacturer's protocol. 
Seventy-two hours post-transfection, the serum-free 
conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
25 spun and the remaining monolayer of cells was lysed using 
2 mL of lysis buffer [50 mH Tris pH 8.0, 150 mM NaCl, 1% 
NP-40, 0.05% SDS with -Gomplete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
instructions], insoluble material was pelleted before 
preparation of SDS-PAGE samples. 
30 Conditioned medium was electroblotted onto a PVDF 

membrane (Novex) after separation by SDS-PAGE on 4-20% 
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gradient gels and probed with an anti-human Fc polyclonal 
antibody (1:500, Jackson lanunoResearch Laboratories, Inc.) 
followed by horseradish peroxidase (HRP) conjugated sheep 
anti-mouse antibody (1:5000, Amersham) , developed using 
chemiluminescent reagents (Renaissance, Dupont) , and exposed 
to autoradiography film (Biomax MR2 film, Kodak) . Human IgGl 
Fc immunoreactivity appeared as a doublet of bands that 
migrated by SDS-PAGE between 148 and 60 kDa standards of the 
Multimark molecular weight markers (Novex) centered 
approximately between 125 and 150 kDa, demonstrating 
secretion mediated by the HKNGl signal peptide. 

10 

A human HKNGlA7ihIgGlFc fusion protein vector was also 
constructed by PCR. The sequence of the HKNG1A7 splice 
variant was amplified by PCR amplification using Exons 1 
through 6 of the full length HKNGl cDNA sequence (SEQ ID 
NO:l) as a template with the following primer sequences: 

^' Vc^^^^'^'^'''^'^'^^^^^<'^^ (SEQ ID N0:58) 

3« primer 5^-TTTTTG6ATCC6CTGCGGCCTCCGTG (SEQ ID NO: 59) 

GTCAG6AGCTTATTTTTCACA6AGGACCAGCT ' 
AG-3 • 



20 



25 



The 5» primer incorporated a Kozak sequence upstream of and 
including the upsream initiator methionine. The 3 • primer 
included the firstly (coding) nucleotides of exon 8 followed 
by nucleotides encodiing a 3 alanine linker. 

The genomic sequence of the human IgGl Fc domain was 
ligated along with the PCR product into a pCDMS vector for 
transient expression. 

The sequenced DNA construct was transiently transfected 
into HEK 293T cells in 150 mm plates using Lipofectamine 
(6IBC0/BRL) according to the manufacturer's protocol. 
Seventy-two hours post-transf ection , the serum-free 
^- conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
spun and the remaining monolayer of cells was lysed using 
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2 mL Of lysis buffer [50 m Tris pH 8.0, iso HaCl i% 
NP-40, 0.05% SDS with -Complete., protease cocktail ' 
(Boehringer Mannheim) diluted according to manufacturers 
instructions], insoluble material was pelleted before 
preparation of SDS-page samples. 

conditioned medium was electroblotted onto a pvdf 
membrane (Movex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an anti-human Fc polyclonal 
antibody (i:500, Jackson ImmunoResearch Laboratories) 
followed by horseradish peroxidase (HRP) conjugated sheep 

10 ^!!:r" "^"^""^ ^--^-)' <*eveloped using 

chemxlumxnescent reagents (Renaissance, Dupont, , and exposed 
to autoradiography film (Biomax MR2 film, Kodak, . Hum^ igci 
PC xmmunoreactivity appeared as a band that migrated by SDS- 
PAGE between 42 and 60 kDa relative to Multimark moleJ[lar 
Z^sl TIT' "'"'"^^ approximately between 36.5 

« TovZ') , '° '^'^^ " molecular weight markers 

PTOESSIbW OP ^Tnn> tnjMMi mrwnn 
A human.HKNGi expression vector was constructed by pgr 
amplxfxcatxon of the human HKNGl cDNA sequence (SEQ id 110:1) 
followed by ligation into an expression vector, pcDNA3.i 
(invitrogen, Carlsbad CA). The full open-reading frame of 

HKHGl cDNA sequence (SEQ ID NO:5) was PCR amplified using 
the following primer sequences: 

5. primer 5-TOctCTCGAGGACTACAGGACACAGC (SEQ ID NO: 60) 

3- primer 5'-TTTTTGGATCCTTATCACCAGGTTTTAAA /SEOTnMn.<,x 
ATGTTCCrTAAAATGC-3 • ^"^^^^^^^ (SEQ ID NO: 61) 

The 5. prijner incorporated a Kozak sequence upstream of and 
including the upstream initiator methionine. The 3* primer 
included a tandem pair of termination codons. 

The sequenced DNA construct was transiently transf ected 
into EEK 293T cells in 150 mm plates using Lipofectamine 
30 (GIBCO/brl) according to the manufacturer's protocol 
seventy-two hours post-transfection, the serum-free 
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conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
spun and the remaining monolayer of cells was lysed using 
2 mL of lysis buffer [50 mM Tris pH 8.0, 150 mM NaCl, i% 
NP-40, 0.05% SDS with "complete" protease cocktail 
(Boehringer Mannheim) diluted according to manufacturers 
* instructions] . Insoluble material was pelleted before 
preparation of SDS-PAGE samples. 

Conditioned medium was electroblotted onto a PVDF 
membrane (Novex) after separation by SDS-PAGE on 4-20% 
gradient gels and probed with an anti-HKNGl polyclonal 
antibody (#84, 1:500) followed by horseradish peroxidase 
(HRP) conjugated sheep anti-mouse antibody (1:5000, 
Amersham) , developed using chemiluminescent reagents 
(Renaissance, Dupont) , and exposed to autoradiography film 
(Biomax MR2 film, Kodak) . HKNGl immunoreactivity appeared as 
a doublet of bands that migrated by SDS-PAGE between 60 and 
95 kDa as determined by Hultimark molecular weight markers 
" (Kovex). 

"•7. EXPREaSI OK OF WATIVE HUMAN HKWai-Vl 

A human HKMGl-Vl expression vector was also constructed 
by PCR amplification of the human HKNGl-Vl cDNA seguence (SEQ 
ID NO: 3) followed by ligation into an expression vector, 
PCDKA3.1. The full open-reading frame of the HKNGl cDNA 
sequence (SEQ ID NO: 6) was PGR amplified using the following 
primer sequences: 

5* primer 5 • -TTTTTCTGAATTCACCATGAAGCCGCCAC (SEQ ID NO: 62) 
TCTTGGTG-3 • 

5* primer 5 • -TTTTTCTCTCGAGACCATGAGGACCTGG6 (SEQ ID NO: 63) 
^* ACTACAGTAAC-3 • «w.oj; 

3« primer 5 • -TTTTTGGATCCTTATCACCAGGTTTTAAA (SEQ ID NO: 61) 
AT6TTCCTTAAAATGC-3 • ' 

The 5' primer incorporated a Kozak sequence upstream of and 
including the upstream initiator methionine. The 3' primer 

_ included a tandem pair of termination codons. 

30 The sequenced DNA construct was transiently transfected 

into HEK 293T cells in 150 mm plates using Lipofectamine 
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(GIBCO/BRL) according to the manuf actxirer • s protocol. 
Seventy-two hours post-transfection, the serum-free 
conditioned medium (OptiMEM, GIBCO/BRL) was harvested and 
spun and the remaining monolayer of cells was lysed using 
2 mL of lysis buffer [50 mM Tris pH 8.0, 150 mM NaCl, 1% 
5 NP-40, 0.05% SDS with "Complete" protease cocktail 

(Boehringer Mannheim) diluted accordi.ig to manufacturers 
instructions]. Insoluble material was pelleted before 
preparation of SDS-PAGE samples. 

Conditioned medium was electroblotted onto a PVDF 
membrane (Novex) after separation by SDS-PAGE on 4-20% 

^0 gradient gels and probed with an M2 anti-HKNGl polyclonal 
antibody (#84, 1:500) followed by horseradish peroxidase 
(HRP) conjugated sheep anti-mouse antibody (l:5000, 
Amersham) , developed using chemiluminescent reagents 
(Renaissance, Dupont) , and exposed to autoradiography film 
(Biomax MR2 film, Kodak) . HKMG immunoreactivity appeared as 

15 a doublet of bands that migrated by SDS-PAGE between 70 and 
95 kDa as determined by Multimark molecular weight markers 
(Novex) , demonstrating secretion mediated by the HKNGl signal 
peptide. 



20 



25 



30 



11. «. BXPRBflSIQK OP HTOtA N HKNS8AP ggSIOM PRp TBTWfl 

Expression vectors were also constructed for human HKNGl 
alkaline phosphatase C-terminal fusion protein (HKNGl :AP) , 
human HKNGl-Vl alkaline phosphatase C-terminal fusion protein 
(HKNG1-V1:AP) , and human HKNGl alkaline phosphatase N- 
terminal fusion protein (AP: HKNGl) . 

The expression vector for human HKNGl :AP was constructed 
by PCR amplification followed by ligation into a vector for 
suitable for expression in HEK 293T cells. The full-length 
open-reading frame of human HKNGl (SEQ ID NO: 5) was PCR 
amplified using a 5« primer incorporating an EcoRl 
restriction site followed by a Kozak sequence prior to the 
upstream initiator methionine. The 3' primer included a Xhol 
restriction site immediately following the final codon of 
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HKNGl. Thus, the open reading frame of the construct 
Includes the HKNGl signal peptide and the full HKNGl sequence 
followed by the full sequence of human placental alkaline 
phosphatase. 

^ The expression vector for human HKNGl-vi:AP was 

constructed by PCR amplification followed by ligation into 
PMEAP3 vector. The full length open reading frame of human 
HKNGl-Vl (SEQ ID N0:6) was PCR amplified using a 5« primer 
incorporating an EcoRI restriction site followed by a Kozak 
sequence prior to the upstream initiator methionine. ThB 3' 
primer included a Xhol restriction site immediately following 
the final codon of SKNGl-^vi. Thus, the open reading frame of 
the construct includes the HKNGl-Vl signal and the full 
length HKNGl-Vl sequence followed by the full sequence of 
human placental alkaline phosphatase. 

The expression vector for human AP:HKN6l was constructed 
15 by PCR amplification followed by ligation into the AP-Tag3 
vector reported by Cheng and Flanagan, 1994, cell 7P: 157-168. 
The full-length open-reading frame of human HKHGi (SEQ id 
NO: 5) was PCR amplified using a 5- primer incorporating a 
BamHI restriction site prior to the nucleotides encoding the 
first amino acids (i.e., APT) of the mature HKN6 protein, and 
20 a 3- primer that included a xhol restriction site immediately 
following the termination codon of HKNGl. Thus, the open 
reading frame of the complete construct includes the AP 
signal peptide and the full sequence of human placental 
alkaline phosphatase, followed by the full HKNGl sequence. 
The sequenced DNA constructs were transiently 
25 transfected in HEK 293T cells in 150 m plates using 

Lipofectamine (GIBCO/BHL) according to the manufacturer's 
protocol. 72 hours post-transfection, the serum-free 
conditioned media (OptlMEM, Gibco/BRL) were harvested, spun 
and filtered. Alkaline phosphatase activity in the 
- conditioned media was quantitated using an enzymatic assay 
kxt (Phospha-Ught, Tropix) according to the manufacturer's 
instructions. When alkaline phosphatase fusion prot in 
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cone ntrations below 2 nM were observed, conditioned medium 
was concentrated by centrifugation using a 30 kDa cut-off 
membrane. Conditioned medium samples before and after 
concentration were analyzed by SDS-PAGE followed by Western 
blot using anti-human alkaline phosphatase antibodies (1:250, 
5 Genzyme) and chemiluminsecent detection. A band at 140 kDa 
was observed in concentrated supernatant of HKNGl:AP, HKNGi- 
Vl:AP, and AP:HKH61 transfections. Conditioned medium 
samples were adjusted to 10% fetal calf serum and stored at 
4ac. 

" 11.9. PURrglCA TION OF glAG-gAfiggp HKNSl PROTEINS 

a?he secreted flag-tagged proteins described in 
subsections 12. l and 12.2 above were isolated by a one step 
purification scheme utilizing the affinity of the flag 
epitope to M2 anti-flag antibodies. The conditioned media 
was passed over an M2-biotin (Sigma) /streptavidin Poros 
column (2.1 X 30 mm, PE Biosystems) . The column was then 
washed with PBS, pH 7.4, and flag-tagged protein was eluted 
with 200 mM glycine, pH 3.0. Fraction was neutralized with 
1.0 M Tris pH 8.0. Eluted fractions with 280 nm absorbance 
greater than background were then analyzed on SOS-PAGE gels 
and by Western blot. The fractions containing flag taged 
protein were pooled and dialyzed In 8000 MWCO dialysis tubing 
against 2 changes of 4L PBS, pH 7.4 at 4oc with constant 
stirring. The buffered exchanged material was then sterile 
filtered (0.2 /tm, Millipore) and frozen at -80flC. 



20 



25 



3-1-10, PDRIPieA TION OF HKKGl Ve PPfllON PROTBIWg 

The secreted Fc fusion proteins described in Subsections 
12.3-12.5 above were isolated by a one step purification 
scheme utilizing the affinity of the human IgGi Fc domain to 
Protein A. The conditioned media was passed over a POROS A 
column (4.6 X 100 mm, PerSeptive Biosystems); the column was 
30 washed with PBS, pH 7,4 and eluted with 200 mM glycine, 

pH 3.0. Fractions were neutralized with 1.0 M Tris pH 8.0. 
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A constant flow rate of 7 ml/min was maintained throughout 
the procedure. Eluted fractions with 280 nm absorbance 
greater than background were then analyzed on SDS-PAGE gels 
and by Western blot. The fractions containing Fc fusion 
protein were pooled and dialyzed in 8000 MWCO dialysis tubing 
5 against 2 changes of 4L PBS, pH 7.4 at 4ac with constant 
stirring. The buffered exchanged material was then sterile 
filtered (0.2 ^m, Millipore) and frozen at -8oac. 

PRODPCTT ON OP a3TOI..HKKgl ANTIBODTBS 

Described in the example presented in this Section is 
the production and characterization of polyclonal and 
monoclonal antibodies directed against HKNGl proteins. 

12.1. PRODPCTI ON OP POI.YCLOMM. MnPIBODTEfi 

Polyclonal antisera were raised in rabbits against each 
of the three peptides listed in Table 3 below. Each of the 
peptides was derived from the HKNGl amino acid sequence (SEQ 
ID NO: 2) by standard techniques (see, in particular, 
HarlowSLane, 1988, Antibodies: A Laboratory Manaal, Cold 
Spring Harbor Laboratory Press, the contents of which is 
incorporated herein by reference in its entirety) . Each of 
20 the peptides is also represented in the HKNGl-Vl polypeptide 
sequence (SEQ ID NO: 4). Antisera was subsequently affinity 
purified using the peptide immunogens. 



TABLE 3 



25 



Antibody 


Peptide/ iDnatmogen 


a. a. residues 






(SEQ ID NO: 2) 


Antibody 84 


APTWKDKTAISENLK 


50-64 


Antibody 85 


KAIEDLPKQDK 


304-314 


Antibody 86 


KALQHFKEHFKTW 


483-495 



PRODPCTrON OP MONOGLOMM, AMTIBODIEfl 
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10 



15 



Monoclonal antibodies were raised in mice by standard 
techniques (see, Harlow S Lane, supra) against the HKN(5-Pc 
fusion protein described in Section 11.3 above. Wells were 
screened by ELISA for binding to the HKNG-Fc fusion protein 
Those wells reacting with the Fc protein were identified by 
ELISA for binding to an irrelevant Fc fusion protein and 
discarded. HKNG-Fc specif ic wells were tested for their 
ability to innnunoprecipitate HKNG-Fc and subjected to isotype 
analysis by standard techniques (Harlow & Lane, supra) , and 
eight wells were selected for subcloning. The isotype of the 

subcloned monoclonal antibodies was confirmed and is 

presented in Table 4 below. 

Based on Western blotting, immunoprecipitation and 

immunostaining data discussed in Subsection 12.3 below two 

monoclonal antibodies (3D17 and 4N6) were selected for large 

scale production. 

TABLE 4 



20 



25 



Clone 


Isotype 


1F24 


2a 


1J18 


2a 


2020 


1 


3D17 


1 


3D24 


2a 


4N6 


1 


|4016 


2b 


|lOC6 


2a 



12.3. WBSTBRH BLOT!riN6 AND IMMOKOPRECIPIKLTION 

OF REGQ HBIMAMT HRKG PRCypii^ TW 

The polyclonal antisera and all eight monoclonal 
. antibodies described in subsection 12. i and 12.2 above were 
30 tested for their ability to recognize recombinant HKNGl 

proteins on Western blots using standard techniques (see, in 
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particular, Harlow & Lane, 1988, Antibodies: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press). Polyclonal 
antisera 84 and 85 and monoclonal antibodies 3D17 and 4N6 
were able to recognize all forms of the mature (i.e., 
5 secreted) recombinant HKNG proteins tested (i.e., HKNGlzFc, 
HKNGl:flag, AP:HKNGl, and native HKNGl) in Western blots. 

Table 5 indicates the ability of each monoclonal 
antibody to immunoprecipitate recombinant HKNGl, as assessed 
by Western blotting of immunoprecipitates with the polyclonal 
antisera 84 and 85. None of the polyclonal antisera were 
10 able to immunoprecipitate recombinant HKNGl proteins. All 
eight monoclonal antibodies immunoprecipitated HKNGl :Fc. 
Immunoprecipitation of the other recombinant HKNGl proteins 
was variable. 



TABLE 5 





Monoclonal 


Protein 




Antibody 


HKNGl :Fc 


HKNGl : flag AP : HKNGl 


HKNGl 
(native) 




IF24 


+ 


+ + 


-/+ 


20 


1J18 


+ 


-/+ 


+/+ 




202 0 




- + 






3D17 


+/+ 


+/+ 


+/+ 




3D24 


+ 








4N6 


+ 


+ + 


+ 


25 


4016 


+ 








10C6 


+ 




+ 



13. CONFIRUaTION OF THE HXNQ N-^FERMIKUS 
AND DlfiULPIDE BOND BTRUCTURE 

30 The exp riments described in this section provide data 

identifying the N-terminus of the mature secreted human HKNG 
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protein. The experiments also provide data identifying the 
disulfide bond linkages between cysteine amino acid residues 
in the mature, secreted protein. 

Specifically, mature, secreted HKNG:flag, HKNG, and 
HKNG:Fc recombinant proteins were produced and purified as 
S described in Section 11 above. The mature recombinant 
proteins were digested with trypsin, and the tryptic 
fragments were identified and sequenced using reverse-phase 
liquid chromatography coupled with electrospray ionization 
tandem mass spectrometry (LC/MS/MS) . The N-terminus of all 
mature secreted proteins tested was unambiguously identified 
10 as APTWKDKT, which corresponds to the amino acid sequence 
starting at alanine 50 of the HKNGl amino acid sequence (PIG. 
l; SEQ ID NO: 2) or alanine 32 of the HKNGl-vi amino acid 
sequence (PIG. 2; SEQ ID N0:4). Thus, although the cDNA 
sequences of HKNGl and HKNGl-Vl encode distinct amino acid 
sequences, the mature secreted proteins produced by these two 
splice variants of the human HKNGl gene are identical, since 
the alternative splicing that gives rise to HKNGl-Vl (i.e., 
the deletion of exon 3) affects the amino acid sequence of 
the proteolytically cleaved signal peptide. The amino acid 
sequence of the mature secreted HKNGl protein is shown in 
2P FIG. 17 (SEQ ID NO: 51) 

The mature secreted HKNG protein is also distinct from 
the RPP amino acid sequence disclosed by Shimizu-Matsumo et 
al. (1997, Invest. Ophthalmal. vis. Sci. 38:2576-2585). 
in particular, amino acid residues i to 20 of the RPP amino 
acid sequence disclosed in Figure 3 of Shimizu-Matsumo et 
25 al., supra, correspond to the cleaved signal peptide of 
HKNGl-Vl. The amino acid sequence of the mature secreted 
form of the EKNGl gene product is depicted in FIG. 17 (SEQ 
ID NO: 51). 

Disulfide bond linkages for 8 of the 13 cysteine 
3^ residues in the mature, secreted HKNG protein were also 
identified from LC/MS/MS of peptides recovered from tryptic 
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5 



10 



digestion of the unreduced proteih. In particular, the 
following disulfide bonded pairs of cysteines were identified 
(numbering refers to the HKNGl protein shown in FIG. i; SEQ 
ID NO: 2): 

Cys 134 to Cys 145; 
Cys 148 to Cys 153; 
Cys 160 to Cys 334; and 
Cys 354 to Cys 362. 

14. RX^MP LE; LOCALIZATION OP HKNG mRNA 
MID PROT HIH EXPRESBIQK 

This example describes experiments wherein the HKNG gene 
product is shown to be expressed in human brain and retinal 
tissue, specifically, in situ hybridization experiments 
performed using standard techniques with a probe that 
corresponded to the complementary sequence of base pairs 9io- 
1422 of the full length BKNGl cDNA sequence (SEQ ID N0:1) 
detected HKNG messenger RNA in the photoreceptor layer (outer 
nuclear layer) of human retina in eyes obtained from the New 
England Eye Bzmk. 

The polyclonal antisera and all eight monoclonal 
antibodies described in Section 12 above were tested for 
immunostaining of human retina. Polyclonal antiserum 85 and 
monoclonal antibodies 1F24, 4N6 and 4016 showed 
immunostaining of HKNG protein in the photoreceptor layer and 
adjacent layers of the retina. The immunostaining in these 
tissues with polyclonal antiserum was blocked by 85 peptide 
immunogen, but not by the other two peptide immunogens (i.e., 
25 84 and 86), confirming that the immunostaining was due to 
HKNG protein expressed in the photoreceptor layer. 

The same antibodies were then used to localize HKNG 
protein by immunostaining in sections of human and monkey 
brain. HKNG protein was observed in cortical neurons in the 
_ frontal cortex. The majority of pyramidal neurons in layers 
30 IV-V were immunoreactive for HKNG protein. A subpopulation 
of neurons was also labeled in layers l-lii. HKNG 



- 130 - 



wo 99/47535 



PCT/US99/05606 



imnunoreactivity was also observed in the pyramidal cell 

layer of the hippocampus and in a small number of neurons in 

the striatum. 

These data further support the fact that HKNG is, 

indeed, a gene which mediates neuropsychiatric disorders such 
5 as BAD. Furthermore, the fact that HKNG is also expressed in 
human retinal tissue suggests that the gene also plays a role 

in myopia conditions. Specifically, Young et al. (1998, 
American Journal of Human Genetics 53:109-119) report a 
strong linkage (LOD =9.59) for primary myopia and secondary 
macular degeneration and retinal detachment in the telomeric 
region of human chromosome I8p. Through fine mapping 
analysis, this candidate region has been narrowed to a 7.6 cM 
haplotype flanked by markers D18S59 and D18S1138 (Young et 
al., supra). However, the marker D18S59 lies within the 
HKMGl gene. This fact, coupled with the finding the HKNG is 
15 expressed in high levels in the retina, strongly suggests 
that the HKNGl gene is also responsible for human myopia 
conditions and/or other eye related diseases such as primary 
myopia, secondary macular degeneration, and retinal 
detachment. 

20 15. EZ&MPLEt ZMM&TDXIE PROTEIN PRODUCTS OF 

THE HKNGl cDHA SEOUBNGES 

This section describes experiments which were performed 
to determine which of the two putative initiator methionines 
encoded by both the full length HKNGl cDNA and the 

alternatively spliced HXNGl-Vl cDNA are used in the synthesis 
25 ^ . 

of immature HKNGl protein. The results indicate that both 
initiator methionines are used at varying levels, resulting 
in the production of three different forms of the immature 
HKNGl protein, referred to herein as immature protein form i 
(IPFl), immature protein form 2 (IPF2) , and immature protein 

- form 3 (IPP3). 
30 



- 131 - 



wo 99/47535 



PCT/US99/05606 



Both the full length HKNGl cDNA sequence shown in FIG. 1 
(SEQ ID NO:i) and the alternatively spliced SKNGl-Vl cDMA 
sequence shown in FIG. 2 (SEQ ID NO: 3) encode predicted 
proteins that have methionines in close proximity to their 
g predicted initiator methionines. The predicted protein 

sequence encoded by the full length SKNGl cDNA sequence has a 
second methionine at amino acid residue number 30 of the 
amino acid sequence depicted in FIG. i (SEQ ID K0:2). Thus, 
although FIG. l indicates that the full length HKNGl cDNA 
encodes the first immature form of the HKNGl protein depicted 
10 in FIG. 1 (referred to herein as IPFI) , the full length HKirGl 
CDNA may additionally encode a second immature protein form 
preferred to herein as IPF2) , whose sequence (SEQ ID NO: 64) 
is provided on the third line of the protein alignment 
depicted in FIG. 17. IPF2 is initiated at methionine 30 of 
the IPFI protein sequence, and is identical to the RPP 
polypeptide sequence taught by Shimizu-Matsumoto et al (1997, 
Invest. Ophthalmol, vis. Sci. 3fl:2576-2585) . Likewise, the 
alternatively spliced HKNGl-Vl cDNA sequence encodes the 
predicted immature protein form, referred to herein as IPF3, 
depicted in FIG. 2 (SEQ ID NO: 4). However, the HKNGl-Vl cDNA 
20 may also encoded another immature protein form, identical to 
IPF 2, that is initiated at methionine 12 of the IPF3 protein 
sequence. FIG. 17 illustrates an alignment of the three 
immature HKNGl protein sequences IPFI (second rov) , IPF2 - 
(third row) , and IPF3 (bottom row) . As explained is Section 
13 above, the mature HKNGl gene product secreted by cells 
25 expressing the HKNGl constructs described in Section 11, 
above, is in fact the same cleaved product (SEQ ID NO: 51), 
regardless of the immature HKNGl protein (IPFI, IPF2, or 
IPF3) from which it is produced. An alignment of the mature 
secreted HKNGl protein is therefore also depicted in FIG. 17 
(top row) . 

Modified HKNGl: flag and HKNGl-Vl: flag expression vectors 
were constructed as described in Sections 12.1 and 12.2, 
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respectively./ However, the nucleotide sequence of full 
length HKNGl was laodified, using standard site directed 
mutagenesis techniques, so as to introduce an additional base 
pair between the upstream methionine (i.e., met 1 in SEQ ID 
^ MO: 2) and the downstream methionine (i.e., met 30 in SEQ ID 
MO: 2). The nucleotide sequence of HKMGl-Vl was likewise 
modified, using standeurd site directed mutagenesis 
techniques, to introduce an additional base between its 
upstream methionine (i.e., met 1 in SEQ ID NO: 4) and 
downstream methionine (i.e., met 12 in SEQ ID NO: 4). Thus, 

10 in both modified constructs, the C-terminal flag epitope tag 
was no longer in the same reading frcune as the upstream 
methionine but was in frame with the downstream methionine. 
Consequently, exclusive translation initiation at the first 
methionine of a construct would lead to the production of 
non-flag immunoreactive proteins. However, exclusive 

15 translation initiation at the second methionine of a 

construct would lead to the production of flag immunoreactive 
proteins. 

Unmodified HKNGl: flag, unmodified HKMG1-Vl:flag, 
modified HKNGl: flag, and modified HKNGl~Vl:flag constructs 
were tremsfected into cells, and their resulting gene 

20 products were harvested, blotted onto a PVDF membrane, and 
probed with an M2 anti-flag polyclonal antibody, and 
developed according to the methods described in Sections 12.1 
and 12*2 above. 

Flag immunoreactivity was detected in all four S2unples. 
The unmodified HKNGl: flag and HKNG1-Vl:f lag expression 

25 vectors produced amounts of matxire secreted HKNGl: flag 

protein consistent with the levels detected in Sections 12.1 
and 12.2 above. Further, the flag immunoreactive band 
detected for the modified HKNGl: flag construct was 
indistinguisheODle in intensity from the hand detected for the 

. unmodified HKNGl: flag construct, indicating that the immatxare 

30 HKNGl protein produced by. full length HKNGl cDNA is 
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predominantly IPF2, while IPFl is produced by full length 
SKNGl cDNA in r latively minor amounts. 

The flag immunoreactive band from the modified HKNGi- 
VI: flag construct had dramatically reduced intensity relative 
to the band from the unmodified HKNG1-Vl:flag construct. 
Thus, HKNGl-Vl produces primarily the immature HKN61 protein 
IPF3, while the immature HKNGl protein IPF2 is produced by 
EKNGl-Vl in relatively minor amounts. These results are 
summarized below in Table 6. 



10 TABLE 6 



15 



Construct 


Immature Protein 


Promincince 


HKNGl 


IPFl (SEQ ID NO: 2) 


Minor 




IPF2 (SEQ ID NO: 64) 


Predominant 


HKNGl-Vl 


IPF2 (SEQ ID N0:64) 


Minor 




IPF3 (SEQ ID NO: 4) 


Predominant 



Thus, the HKNGl gene products of the invention include 
gene products corresponding to the immature protein forms 
IPFl and IPF3. However, preferably the HKNGl gene products 
of the invention do not include zuaino acid sequences 
consisting of the IPF2 sequence (SEQ ID NO: 64). 



16 • KBFEREHCBS CITED 

The present invention is not to be limited in scope by 
the specific embodiments described herein, which are intended 
2j as single Illustrations of individual aspects of the 

invention, and functionally equivalent methods and components 
are within the scope of the invention. Indeed, various 
modifications of the invention, in addition to those shown 
and described herein will become apparent to those skilled in 
the art from the foregoing description and accompanying 

- drawings . 
30 
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All publications, patents, and patent applications 
mentioned in this specification are herein incorporated by 
reference to the sane extent as if each individual 
publication or patent application was specifically and 
individually indicated to be incorporated by reference. 
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15 



WHAT IS CIA7 MT?n Tg> 

1. An isolated nucleic acid molecule comprising a 
nucleotide sequence that encodes a HKNGl gene product 
comprising: 

(a) the amino acid sequence of SEQ ID NO: 2; 

(b) the amino acid sequence of SEQ ID NO: 4; 

(c) the amino acid sequence of SEQ ID NO:39; 

(d) the amino acid sequence of SEQ ID N0:4i; 

(e) the amino acid sequence of SEQ ID NO: 43; 

(f) the amino acid sequence of SEQ ID NO: 45; 

(g) the amino acid sequence of SEQ ID NO: 49; or 

(h) the amino acid sequence of SEQ ID NO: 66. 

2. The isolate nucleic acid molecule of Claim 1, 
wherein the isolate nucleic acid molecule comprises: 

(a) the nucleotide sequence of SEQ ID NO:l; 

(b) the nucleotide sequence of SEQ ID NO: 3; 

(c) the nucleotide sequence of SEQ ID NO: 7; 

(d) the nucleotide sequence of SEQ ID NO: 34; or 

(e) the nucleotide sequence of SEQ ID NO: 35. 

3. The isolated nucleic acid molecule of Claim 1, 
wherein the isolated nucleic acid molecule comprises: 

(a) the nudleotide sequence of SEQ ID NO: 38; 

(b) the nucleotide sequence of SEQ ID NO: 40; 

(c) the nucleotide sequence of SEQ ID NO: 42; or 

(d) the nucleotide sequence of SEQ ID NO: 44. 

4. The isolated nucleic acid molecule of Claim 1, 
25 wherein the isolated nucleic acid molecule comprises: 

(a) the nucleotide sequence of SEQ ID NO: 46; 

(b) the nucleotide sequence of SEQ ID NO: 47; or 

(c) the nucleotide sequence of SEQ ID NO: 48. 

5. An isolated nucleic acid molecule consisting of a 
^0 nucleotide sequence that encodes a mature HKNGl protein 

having the amino acid sequence of SEQ ID NO: 51. 
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6. An isolated nucleic acid molecule which hybridizes 
to the complement of the nucleic acid molecule of any one of 
claims 1-5 under highly stringent conditions comprising 
washing in 0.ixSSC/o.i% SDS at 68 ac. 

7. An isolated nucleic acid molecule which hybridizes 
to the complement of the nucleic acid molecule of any one of 
claims 1-5 under stringent conditions comprising washing in 
0.2XSSC/0.1% SDS at 50-65flC. 



8. The isolated nucleic acid molecule of claim 6 or 7 
10 wherein said isolated nucleic acid molecule encodes a 

functionally equivalent HKNGl gene product. 

9. A vector comprising the nucleotide sequence of any 



one of Claims 1-5. 



15 



10. An expression vector comprising the nucleotide 
sequence of any one of Claims 1-5 operatively associated with 
a regulatory nucleotide sequence controlling the expression 
of the nucleotide sequence in a host cell. 

11. A host cell genetically engineered to contain the 
nucleotide sequence of any one of Claims 1-5. 

12. A host cell genetically engineered to express the 
nucleotide sequence of any one of Claims 1-5 operatively 
associated with a regulatory nuclfeotide sequence controlling 
expression of the nucleotide sequence in said host cell. 

25 

13. An isolated polypeptide comprising the amino acid 
sequence of a HKNGl gene product having: 

(a) the amino acid sequence of SEQ ID NO: 2; 

(b) the amino acid sequence of SEQ id NO: 4; 

(c) the amino acid sequence of SEQ id NO: 39; 

(d) the amino acid sequence of SEQ ID NO: 41; 

(e) the amino acid sequence of SEQ ID NO:43; 
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(f) the amino acid sequence of SEQ id NO: 45; or 

(g) the amino acid sequence of SEQ ID NO:49; 

(h) the amino acid sequence of SEQ id NO: 66. 

14. An isolated polypeptide consisting of a mature 
^Gl gene product having the amino acid sequence of SEQ id 



15. An isolated polypeptide comprising an amino acid 
sequence encoded by the isolated nucleic acid molecule of 
Claim 6 or 7. 

10 

16. An antibody which selectively binds to the HKNGl 
.gene product of any one of claims 13 or 14. 

17. A method for treating a HKNGl-mediated disorder in 
an individual comprising administering to the individual a 
compound which modulates the expression of an HKNGl gene in 
the individual. 

18. The method of Claim 17, wherein the compound 
inhibits or potentiates the expression of an HKNGl gene in 
the Individual. 

20 

19. The method of Claim 17, wherein the compound is a 
small molecule. 

20. The method of Claim 17, wherein the HKNGl-mediated 
disorder is a neuropsychiatric disorder. 



25 



21. The method of Claim 17, wherein the 
neuropsychiatric disorder is bipolar affective disorder or 
schizophrenia. 

22. The method of Claim 17, wherein the HKNGl gene 
encodes a HKNGl gene product comprising: 

(a) the amino acid sequence of SEQ ID NO:2; 
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(b) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO* A • 


(c) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO* • 


(d) 


the 


amino 


acid 


seG[uence 


of 


SEQ 


ID 


NO • A 1 • 


(e) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 43; 


(f) 


the amino 


acid 


sequence 


of 


SEQ 


ID 


NO:45; 


(g) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 49; 


(h) 


the 


cimino 


acid 


sec[uence 


of 


SEQ 


ID 


NO: 51; 


(i) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 64; or 


(j) 


the 


cunino 


acid 


sequence 


of 


SEQ 


ID 


NO: 66. 


23. 


The 


method 


i of Claim 17, 


wherein 


the individual is 



nannial. 



24. The method of Claim 23, wherein the mammal is a 
human. 



25. A method for treating a HKNGl-mediated disorder in 
an individual comprising administering to the individual a 
compound which modulates the e3q>ression or activity of a 
HKNGl gene product in the individual. 

26. The method of Claim 25, wherein the compound 
inhibits or potentiates the e3q>ression or activity of a HKNGl 

20 gene product in the individual. 

27. The method of Claim 25, wherein the compound is a 
small molecule. 

28. The method of Claim 25, wherein the HKNGl-mediated 
25 disorder is a neuropsychiatric disorder. 

29. The method of Claim 28, wherein the 
neuropsychiatric disorder is bipolar affective disorder or 
schiz ophrenia . 

30 

30 • The method of Claim 25, wherein the HKNGl gene 
product comprises: 
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(a) 
(b) 
(c) 
(d) 
(e) 
(f) 

(g) 

(h) 
(i) 
(j) 



the 
the 
the 
the 
the 
the 
the 
the 
the 
the 



amino 
amino 
amino 
amino 
amino 
eunino 
amino 
amino 
amino 
amino 



acid sequence 
acid sequence 
acid sequence 
acid sequence 
acid sequence 
acid sequence 
acid sec[uence 
acid sequence 
acid sequence 
acid sequence 



of SEQ ID NO: 2; 
of SEQ ID M0:4; 
of SEQ ID MO:39; 
of SEQ ID NO: 41; 
of SEQ ID NO:43; 
of SEQ ID NO:45; 
of SEQ ID NO:49; 
of SEQ ID N0:51; 
of SEQ ID NO: 64; or 
of SEQ ID NO: 66. 



" 31. The method of claim 25, wherein the individual is a 

mammal. 

32. The method of Claim 31, wherein the mammal is a 
human. 

*5 33. A method for identifying a compound which modulates 

expression of an HKNGl gene comprising: 



20 



(a) 
(b) 
(c) 



contacting a test compound to a cell that expresses 
an HKNGl gene; 

measuring a level of HKNGl gene expression in the 
cell ; 

comparing the level of HKNGl gene expression in the 
cell in the presence of the test compoiand to a 
level of HKNGl gene expression in the cell in the 
absence of the test compound, 
wherein if the level of HKNGl gene expression in the cell in 
the presence of the test compound differs from the level of 
25 ejcpression of the HKNGl gene in the cell in the absence of 
the test compound, a compound that modulates expression of an 
HKNGl gene is identified. 



34. The method of Claim 33, wherein the HKNGl gene 
encodes an HKNGl gene product comprising: 

(a) the amino acid sequ nee of SEQ ID NO: 2; 

(b) the amino acid sequence of SEQ ID NO: 4; 
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(c) 


the 


amino 


acid 


sequence 


of 


SEQ 


TD 

X Li/ 


Mn • 1 Q • 


(d) 


the 


amino 


acid 


sec[iience 


of 


SEQ 




Mn • At • 


(e) 


the 


amino 


acid 








JlU 


£%Ui 4 J J 


(f) 


the 


amino 


acid 


seG[uence 


of 


SEQ 


ID 


NO: 45; 


(g) 


the 


amino 


acid 


sec[uence 


of 


SEQ 


ID 


NO: 49; 


(h) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 51; 


(i) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 64; 


(j) 


the 


amino 


acid 


sequence 


of 


SEQ 


ID 


NO: 66. 



35. The 
comprises: 



method of Claim 34, wherein the HKNGl gene 



(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 

(a) the nucleotide 



sequence 

sequence 

sequence 

sec[uence 

seqpience 

sequence 

sequence 

sequence 

sequence 

sequence 

sec[uence 

sequence 

sequence 

sequence 



of SEQ 
of SEQ 
of SEQ 
of SEQ 
Of SEQ 
Of SEQ 
of SEQ 
Of SEQ 
of SEQ 
Of SEQ 
of SEQ 
Of SEQ 
of SEQ 
Of SEQ 



ID MO:l 
ID NO: 3 
ID NO: 5 
ID NO: 6 
ID NO: 34 
ID NO: 35 
ID NO: 38 
ID N0:40 
ID NO: 42 
ID NO: 44 
ID NO: 46 
ID NO:47 
ID NO: 48 
ID NO: 65 



or 



36. A method for identifying a compoiind which modulates 
expression or activity of an HKNGl gene product comprising: 

(a) contacting a test compound to a cell that expresses 
an HKNGl gene product; 

(b) measuring a level of HKNGl gene product expression 
or activity in the cell; 

(c) comparing the level of HKNGl gene product 
expression or activity in the cell in the presence 
of the test compound to a level of HKNGl gene 
product expression or activity in the cell in the 
absence of the test compound. 
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Wherein if the level of HKNGl gene product expression or 
activity in the cell i„ the presence of the test compound 
dxffers from the level of HKNGl gene product expression or 
activity in the cell in the absence of the test compound, a 
compound that modulates expression or activity of an HKNGl 
gene product is identified. 

37. The method of claim 36, wherein the HKNGl gene 
product comprises: 

(a) the amino acid sequence of SEQ ID N0:2; 
the amino acid sequence of SEQ ID NO: 4; 
the amino acid sequence of SEQ id NO: 39; 



(to) 
(c) 



(d) the amino acid sequence of SEQ ID NO: 41; 

(e) the amino acid sequence of SEQ ID NO: 43; 

(f) the amino acid isequence of SEQ ID NO-45* 

(g) 

(h) 



15 



the amino acid sequence of SEQ id NO: 49; 
the amino acid sequence of SEQ ID NO: 51; or 



(i) the amino acid sequence of SEQ ID NO: 64, 



38. A method for identifying an individual having or at 
rxsk of developing a HKNGl-mediated disorder comprising the 
step of detecting the presence or absence of a polymorphism 
that correlates with an HKNGl allele associated with the 
disorder, wherein presence of the polymorphism indicates 
that the individual has or is at risk of developing the 
HKNGl-mediated disorder. 

39. The method of Claim 38, wherein the mutation 
results in production of a protein comprising an amino acid 
sequence that is different from the amino acid sequence of 
SEQ ID N0:2 or 4. 

40. The method of Claim 39, wherein the mutation 
results in the substitution of a lysine for a glutamic acid 

. at amino acid residue 202 of SEQ id NO: 2. 
30 
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41. The method of Claim 39, wherein the mutation 
results in the substitution of a lysine for a glutamic acid 
at amino acid residue 184 of SEQ ID NO: 4. 

42. The method of Claim 36, wherein the method 
comprises the step of analyzing the sequence of the coding 
region of the human HKNGl gene by preparing and sequencing 
cDNA comprising a sequence that hybridizes under stringent 
conditions to the complement of a nucleotide sequence which 
encodes the polypeptide sequence depicted in SEQ ID NO: 2. 
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CC£OntAaAeAAAA0GCACSAT»«>*"<«»M"*'» . 



«"«~*'»>-waiot « aciaT CT T uxat! ACTqAAA 

™^™O«»e«»»Q00OTTTCAC» 



FIG. 3M 



wo 99/47535 



18/57 



PCT/US99/05606 



»TAAACTATCATPAaTACACaC*XTXCTMTAAA«mrACCmO(WTm 
«CAAXTT««;CATTa:«»aC«TCTAAAC^ 



«^«-«AA^^ 

"TCTTTCTACOaUXaAAAATAtjrTXXrrTAATAATCC^ 

*^r«^u««A«^^ 

«^«««»«».««^^ 

*==»«™«°»*«iaacAAAoco^ 

»0CTACICMA«aT«OOCn«^ 

GrCAAATGTTAAmCACTATCTAKTTArrAATCaU^^^ 

~««cwmaAi»cAaacAA««^^ 

TTWaTTMCTTTT^ 

nCA«W«ms«CCCTAATC*AC«TA*AT^ 
CTTAfflUSUllAATtlMaATWCAtTBMAOO^^ 
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TCATACX:rTTAATC^AATAATA^rTT«;«:,^ 

«cceoa»»oc»e«mi i nniri< x« .c i^ ^^^ rnnnTiiimfi,. 

«TTTAACOTAGTTCA«GTCTTT*auuuvCAACM«^ 
«:TILTITCTGCTTAfi«GT™TAG«5CrMIU«a^ 

«:TCI«A»ATTM*CCrCTTATTJ«n«S^^ 
CGA(STTTCA«CaOCCT««TAima«!0^^ 

aacmCWmaULTAACTITTlOaTAJUaTG^^ 



AChOCMOCTCKTTAAAAA 

««»»TTCB«TCI«OCITTtaw,«:^ 
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OOCrrOCTCTJUiTTtC 
AiXArmAAOOMOkl 
AAAACAAAACTTGTTO 

^TCrjW»TTtCCIUXaTTATTTJU»TATTAJtt«OTec« 



oeci'itJLTCT&CTTcauciueTcauKaBaccci ^ 

WACCTCWAAAACATCTTTAACCTCATTAATi 



TAATAAAG 
rCATGAAA 
UXXATAA 
USATCTAG 



^ * ^^^^^^^^^^'^AGAGCCATAAAATTTCCrrr^ 



MATAOTTAATTC 
^ACMGATOOCAT 
LTATATCATGTAT 
aTATAATAATTD 
'ATTTAAATATTAi 

OGkOKVKuaua 

WaaCTAAAAlAAAAAAAATTAOOOtMaaTOWaXSS^^ 
«««**«ACAAG«««* 

^^^^^^^^^^^^^^^^^ 

^^^^^^^^^^^^^^^^^^^^^^ 



TCAAATGATiv 



iTAOGTTTTAGAGAAACGAAGOCrCA 
«««IU.WAC™»mo»««A«AACA««A«0^ 



««»=««CC««5(CTAClGACTAt^ 
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iA*wrATnnTATAATrPTTATGGTTTTTTATTAT^ 

^ — inTiTmoiljLi ^.„. 
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^'^ACTtauUVAAGTTCCAACATAClT^ 

c«=rrmcm«c„™^, =™«'"««==="tto««^^ 



rTAWauVTfflUSAAATTTTATCTCXTTACATTTTCTaXrTW^ 



TATACTACAAATTTCTJ 



TCCAATAAGTTAACGTATCCACTAATAATTATTA' 



TTTCT 



TCCTAGAACAAGACA(XyiTTAAGCATC\TCACarTCCC 



TCWWCACTATTCaiGCrPCT^^ 



kTCAOCSCCTCCTGC 



TATTOGGGGATX! 
TCCCCTGCTGCTT 



OCTATTTO«maUU«aUVATA*ACCAGCC^^ 
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AGTTGCGTCCCTCTCTGTTGCCAGGCTGGAGTTCAGTGGCATGTTCATAGCTC 

ACTGAAGCCTCAAATTCNTGGGTTCAAGTGACCCTCCTACCTCAGCCCCATGA 

GGACCTGGGACTACAGTTCCCTCCCTTTGGAACGCAGCGTGGGCACCTGCAA 

CGCAGAGACCACTGTATCTCCGGTGCAGAATGTAATGAGTGCCTGATACATT 

TGCCGAATAAACTATTCCAAGGGTTGAACTTGCTGGAAGCAANAGAAGCACT 

ATTCTGGTAACAGCGGGAACATGAAGCCGCCACTCTTGGTGTTTATTGTGTGT 

CTGCTGTGGTTGAAAGACAGTCACTGCGCACCCACTTGGAAGGACAAAACTG 
CTATCAGTGAAAACCTGAAGAGTTTTTCTGA 
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AGTTGCGTCCCTCTCTGTTGCCAGGCTGGAGrrCAGTGGCATGTTCTTAGCTC 

ACTGAAGCCTCAAATTCCTGGGTTCAAGTGACCCTCCCACCTCAGCCCCATGA 

GGACCTGGGACTACAGATGGAGTCTTGCTCTCGTTGCCCAGACTGGAGTGCA 

CTGCTGCGATCTCAGCTCACTGCAACCTCTACCTCCCAGGTTCAAGCGATTCT 

CCTGCCTCAGCCTCTCGAGTGGCTGGGACTATAGTAACAGCGGGAACATGAA 

GCCGCCACTCTTGGTGTTTATTGTGTGTCCGCTGTGGTTGAAAGACAGTCACT 

GCGCACCCACTTGGAAGGACAAAACTGCTATCAGTGAAAACCTGAAGAGnr 
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CTTGGAGTCAACTCZAGTGTQGA< 



lCTOAAAC I'lVCAAAAACTXIACATGAGGAGTCyvCTGGAGAATCATGATCAAGGACCrA 



79 



M K L 3 

CACaCTCTGACTTAACTTTATTCTtnGGACAATt^^ ATG AAG CTC 153 

Pl-^WFPVCLLWLKDCHCAPT 23 

CCA CTT TTG ATG TTT CCC GTG TCT CTG CTA TGG TTG AAA GAC TGT CAT TGT GCA CCT ACT 213 

*''^°*CTAIS BMANSPSEAGEI 43 

TGG AAG GAC AAA ACT GCC ATC AGT GAA AAC GCG AAC ACT TTT TCT GAG GCT GGG GAG ATA 273 

DVDGBVKIALICIK OMKIMM €3 

GAC CTA GAT GGA GAO GTG AAG ATA GCT TTG ATT CGC ATT AAA CAG ATG AAA ATC ATO ATG 333 

BRRBEBHSKLMKTLKKCKEE 83 

GAA AGO AGA GAG GAA GAA CAC AGC AAA CTA ATG AAA ACC TTG AAG AAG TCC AAA GAA GAA 393 

KQBALKLMNBVHEHLEBEBS 103 

AAO CAG GAG GCC CTG AAA CTT ATG AAT GAA GTT CAT GAA CAC CTG GAG GAG GAA GAA AGC 453 

^CQVSL ADSWDBCRA CLESN 123 

TTA TCC CAG GTT TCT CTG GCA GAT TCC TGG GAT GAA TGC AGG GCT TGC CTG GAA AGT AAC 513 

C-MRFDTTCQPAWSSVKNMVE 143 

TGC ATG AGG TTT GAT ACC ACC TGC CAA CCT CCA TGG TCC TCT GTG AAA AAT ATG GTG GAA 573 

OPPRKIYOPLPPLQENDRSG 

CAG TTT TTC AGG AAG ATC TAT CAG TTT CTG TTT CCT CTC CAG GAA AAT GAC AGA AGT GGC £33 



TBBDAQVSHIEHVF 



RRBPD QAPQ SYFTSGTDVT 



P P P S V S KB PAYRADAE P 



GAG CCT TTC TTT TTT CCA TCT TTG TCC AAG GAG CCA GCC TAC AGA GCA GAT GCT GAG 



183 



CCT OTC AGC AAA GGG GTC ACT GAG GAA GAT GCG CAG GTG TCA CAC ATA GAG CAT GTG TTC 693 

SQI'SADVTSLPNRSLYVFKQ 203 
AGC CAG CTG AGC GCA GAT GTG ACA TCT CTC TTC AAC AGA AGC CTT TAC CTC TTC AAA CAG 753 



223 



CTG CGG CCA GAA TTT GAC CAG GCT TTT CAG TCA TAT TTC ACA TOG GGG ACT GAC GTT ACA 813 



243 
CCA 873 



SHAZPNVFQLLCNZ. SFSVYQ 263 

AGC TGG GCC ATT CCC AAT GTC TTC CAG CTG CTC TGC AAC TTG AGT TTC TCH GTT TAT CAA 933 

S VS BKLITTLR ATEDPPKQD 283 

AGT OTC AGT GAA AAA CTC ATC ACA ACC CTG OGT GCC ACA GAG GAC CCT CCA AAA CAA GAC 993 

KDSHQQGPISKILPBQDRGS 303 

AAA GAC TCC AAC CAG GOA GGC CCS ATT TCA AAG ATA CTA CCT GAG CAA GAC AGA GGC TCA 1053 

DGKLGQNLSOCVN FRKRCQ K 323 

GAT GGG AAA CTT GGC CRG AAT TTG TCT GAT TGC GTT AAT TTT CGC AAG AGA TGC CAG iKAA 1113 

CQDYL SDDCPNVPELYRELN 343 

TGC CAG GAT TAT CTA TCT GAT GAC TGC CCT AAT GTG CCT GAA CTA TAC AGA GAA CTC AAT 1173 

BALRLVSRSHQQYDQV VQMT 363 

GAG GCC CTC CGA CTG GTC AGT AGA TCC AAT CAG CAA TAC GAC CAG GTC GTG CAG ATC ACC 1233 

QYHLBDTTLLMBKMRBQPGM 383 

CAG TAT CAC CTC GAA GAC ACC AOG CTT CTC ATC GAG AAG ATC AGA GAG CAG TTT GGC TGG 1293 

VSBLAYQSPOAEDIFNPVKV 403 

- GTT TCT GAA CTG GCA TAC CAG TCC CCA GGA GCT GAG GAC ATC TTT AAT CCA GTC AAA GTA 1353 
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MVALSAHEGNSSDonn*,, 
AW «A GCC CTA ACT GCT CAT OAA G«A «T TCT TCT ««.T cL. G^T G^C aL G^G C^r ,1" 

TCA AGC CIC CTO CCT TCC TCI AAC TTC ACA CTC AGC AGC CCT CTT GAA a1[g ,4 «C 103 

AAC GCr A^ T?C AW G^T C^C G^G g™ gL aL G^r C^T cL c!!c T^ aJ^ C^ T?T A" 

K T W • 
AAA ACT TGG TAA 

154S 

GAASATTTAGTCCATCCTATAATCAGCAAOAATTACACCTTCCCCCAAGACCTtSAOAATTCTGAAAATACAAAG^^^ 
TAACACAATGAACACAGCTGOlTGAAAGTTAGGTATATATTAGGAAGCACTATTGGlTTACTTTGTTGAATGGAAGr™ 1703 

AATA6CTATTCAAATTGA0TTAATATAAAAATITCTTCCTAAAAAGTAAAATCTACATATGTA6AATATt»TGCATTAG 1782 

TTCriTCTATACTAAATAAATACTCAOTCCCCT 



1815 
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63 
333 



CTTKKyVGTaJVCTCACnxnxWAC^^ 

M K L 3 

CACACTCT<aiCTTWlCTTTATTCT<^^ 

^^^"PPVCLLWLKDCHCAPT 23 

CCA CTT TTO ATG TTT CCC GTG TCT CTG CTA TGG TTG AAA GAC 213 

J!'^°'^^AISENANSFSEAGEI 43 

TGG AAG GAG AAA ACT GCC ATC AGT GAA AAC GCG MC ACT TTT TCT GAG GCT GCX; GAG AT^ 273 

'^^°®SVKIAI.IGIKQMKIMM 
GAC GTA GAT GGA GAG GTG AAG ATA GCr TO ATT GGC ATT AAA CAG ATO AAA ATC ATO ATG 

"'*^^®^«SKI'MKTLKKCKEE 83 

CAAAGOAGAGAGGAAGAACACAGCAAACTAATGAAAACCTTGAAGAAGTCCAAACAAGAA 393 

AAO CAO GAG GCC era AAA err ATO AAT GAA GTT CAT GAA CAC CTG GAG GAG GAA GAA AGC 453 

LCQVSLAOSHDBCRACLES N 123 

TTA TGC CAG GTT TCT CTG 6CA GAT TCC TGG GAT GAA TCC AGG GOT TGC Crc GAA AGT AAC S13 

^^■"^'''TTCQPAWSSVKNMEN 145 

TGC ATO AGO TTT GAT ACC ACC TGC CAA CCT GCA TGG TCC TCT GTG AAA AAT ATC GAA AAT 573 

DRSGP VS KGVTEEDAQVSHI 163 

GAC AGA AGT GGC CCT GTC AGC AAA GGG GTC ACT GAG GAA GAT GCG CAG GTO TCA CAC ATA 633 

EHVFSQLSADVTSLPNRSLy 183 

GAG CAT GTG TTC AGC CAG CTO AGC GCA GAT GTG ACA TCT CrC TTC AAC AGA AGC CTT TAC 693 

VFKQ LRREPDQAFOSYPTSG 203 

GTC TTC AAA CAG CTO CGG CGA GAA TTT GAC CAC GCT TTT CAG TCA TAT TTC ACA TCG GGG 753 

TDVTEPFFPPSLSKEPAYRA 223 

ACTCaCCTTACAGAGCCTTTCTTTTTTCCATCTTOTCCAAGGAGCCAGCCTACAGAGCA 813 

DAEPSWAIPMV FQLLCMLSP 243 

GAT GCT GAG CCA AGC TGG GCC Arr CCC AAT GTC TTC CAG CTO CTC TGC AAC TTG AGT TTC 373 

Jn. Zr '^VSBKI'ITT LRATEDP 263 

TCAGTTTATCRAAGTGTCAGTGAAAAACTCATCACAACCCroCGTGCCACAGAGGACCCT 933 

PKQDKDSNQGGPZSKXLPBQ 283 

OCAAAACAAGACAAAGACTCCAACCAGGGAGGCCCOATTTCAAAGATACTACCTaAOCAA 993 

0«GSDOKLGQNLSDCVN FRK 303 

GACAGAGGCTCAGATGGOAAACTTGGCCAGAATTOTCraATTGCGTTAATTTTCGCAAG 1053 

^^S^^^^^^^^^^^CVKVP EL Y 323 

AfiATOCCAOAAATGCCAOaATTATCTATCTGATGACTGCCCTAATGTCCCTGAACTATAC 1113 

RBLHEALRLVSRSNQQYDQV 343 

AOAaAACrCAATGAGGCCCTCCGACTCGTCAGTAGATCCAATCAGCAATACGACCAOGTC 1173 

Z.^?«.1''°^"''^°^^^^»««KMRE 363 

GTOCAGATQACCCAGTATCACCTCGAAGACACCAOGCrrCTCATCGAGAAGATOAGAGAO 1233 



OPG HVSELAYQSP 



<5 A B D I F N 383 

1293 



CACTrrGGCTGGGTTTCTGAACTCGCATACCAGTCCCCAGGAGCTGAGGACATCTTrAAT 

r^JL/^"''^''^^«^°NSS«>QDO 403 
CCA GTO AAA OtA ATO GTA CCC CTA AGT CCT CAT GAA GGA AAT TCr TCT GAT CAA GAT GAC 1353 



FIG. 8A 



wo 99/47535 



32/57 



PCT/US99/05606 



aL G^G G^r t^T tL TCT A^C 4 AGC C^T gL ll" 

AAG AGT GCT GGC AAC GCT AAC TTC ATT GAT CAC GTO GTA GAG AAG GTT err cic c!Ic m 1473 

451 
1497 



KBHFKTW* 
AAG GAG CAC TTT AAA ACT TGG TAA 



GAAGATTTAGTCCATCCTATAATCAGOUUSAATTACACCTTCGGCCAAGACCTGAGAATTC^^ lS7j 

TAACACAATGAACAaUSCTGCAttaUUCTTAGGTATATATTAGGAAGCACTATTGGTTTACTT^^^ 

AATAGCTATTCAAATTGAGTTAATATAAAAATrrCTTOTAAAAAGTAAAATGTACATATGTAGAATATGATG^^ 
TTCTTTGTATACTAAATAAATACTGAGTCCCCT 



1655 
1734 
1767 
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CTTGGACTCAACTGAGTGTCGACTGAAACTICCAAAAACTGAaVT^ 7 9 



H K L 3 

CACACTCTGACTTAACTTTATTCTGTGCSACAATGAGAGACAACTGCAAGGATrAA ATO AAO CTG 153 

PLLHPPVCLLWLKDCKCAPT 23 

CCA CTT TTG ATG TXT CCC GTG TCT CTG CTA TGG TTG AAA GAC TGT CAT TGT GCA CCT ACT 2X3 

WKDKTAISENANSPSBAGBI 43 

TCG AAG GAC AAA ACT GCC ATC ACT GAA AAC GCG AAC AGT TTT TCT GAG CCT GGG GAG ATA 273 

DVDGEVKIALIGIKQMKIMM 63 

GAC GTA GAT GGA GAG GTG AAG ATA OCT TTG ATT GGC ATT AAA CAG ATG AAA ATC ATG ATG 333 

BRREBEHSKLMKTLKKCKBB 83 

GAA AG6 AGA GAG GAA GAA CAC AGC AAA CTA ATG AAA ACC TTG AAG AAG TGC AAA GAA GAA 393 

KQBAZ«KLMNBVHB H LBBEES 103 

AAG CAO GAO GCC CTG AAA CTT ATG AAT GAA GTT CAT GAA CAC CTG GAG GAG GAA GAA AGC 453 

LCQVSLAOSHDECRACLESH 123 

TTA TGC CAO GTT TCT CTG GCA GAT TCC TGG GAT GAA TGC AGG GCT TGC CTG GAA AGT AAC 513 

CMRFDTTCQ, PAHSSVKHHBP 143 

TGC ATG AGG TTT GAT ACC ACC TGC CAA CCT GCA TGG TCC TCT GTG AAA AAT ATG GAG CCA 573 

AYR ADABPSWAIPNVPQLLC 1€3 

GCC TAC AGA GCA GAT GCT GAG CCA AGC TGG GCC ATT CCC AAT GTC TTC CAG CTG CTC IGC 633 

NLSPSVYQSVSEKL2TTLRA 183 

AAC TTG ACT TTC TCA GTT TAT CAA AGT GTC AGT GAA AAA CTC ATC ACA ACC CTG CGT GCC 693 

TEDPPKQDKD5NQ6GPZSKI 203 

ACA GAO GAC CCT CCA AAA CAA GAC AAA GAC TCC AAC CAG GGA GGC COG ATT TCA AAG ATA 753 

X*PBQDRGSDGKLGONX«SDCV 223 

CTA CCT GAG CAA GAC AGA GGC TCA GAT GGG AAA CTT GGC CAG AAT TTG TCT GAT TGC GTT 813 

HFRKRCQKCQDYLS DDCPHV 343 

AAT TTT CGC AAG AGA TGC CAG AAA TGC CAG GAT TAT CTA TCT GAT GAC TGC CCT AAT GTG B73 

PBLYRELHBALRLVSRSKQQ 263 

CCT GAA CTA TAC AGA GAA CTC AAT GAG GCC CTC OGA CTG GTC AGT AOA TCC AAT CAG GAA 933 

YDQVVQ MTQYR LBD'TTLLMB 283 

TAC GAC CAO GTG GTG CAG AXO ACC CAO TAT CAC CTG GAA CAC ACC ACQ CTT CTG ATG GAG 993 

KNRBQPQHVSELAYQSPGAB 303 

AAO ATG AGA GAG CAO TTT GGC TGG GTT TCT GAA CTG GCA TAC CAG TCC CCA GGA GCT GAG 1053 

DZFHPVKVM VALSARBGNSS 323 

GAC ATC TTT AAT CCA OTO AAA GTA ATG GTA GCC CTA ACT OCT CAT GAA GGA AAT TCT TCT 1113 



DQCDTVVPSSLLPSSHPTLS 343 

GAT CAA GAT GAC ACA OTO GTT OCT TCA AGC CTC CTG CCT TCC TCT AAC TTC ACA CTC AGC 1173 

SP I#EKSAGHAHP1DHVVBKV 363 

AGC CCT CTT GAA AAO AGT GCT GGC AAC GCT AAC TTC ATT GAT CAC GTG GTA GAG AAG GTT 1233 



LQHFKBHFKTH* 375 
CTT CAG CAC TTT AAO GAG CAC TTT AAA ACT TGG TAA 1269 

GAAQATTTAGTCCATCCTATAATCAOCAAGAATTACACCTTOGGCCAAGACCTGAGAATTCTC^^ 1 3 4 S 
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TAA<»CAATCAACACAGCTG<»TGAAAGTTA«STATATATTAOGAAOCACTATTC(^ 

AATAGCTATTCRAATT(H«m-AATATAAAAATTTCTTCCTAAAAACTAAA^ 

TTCTTTXJTATACTAAATAAATACTCAGTCCCCT 
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CTTOGAGTCAACTGAGTGTGGACTGAAACTTCCAAAAAC^^ 79 

M K L 3 

CACACTCTCaiCrTAMnTTATTCTGTGGACAATGWIAG ATG AAO CTC IS 3 



PLLMPPVCLLWLKDCHCAPT 23 

CCA CTT TTG ATG TIT CCC GTG TGT CTG CTA TGG TTG AAA GAG TGT CAT TGT 6CA CCt ACT 213 

WKDKTAISEMANS FSBAGEI 43 

TGG AAG GAC AAA ACT GCC ATC ACT GAA AAC GCG AAC AGT TTT TCT GAG GCT 6GG GAG ATA 273 

DVDGEVKZALIGIKQMKIMM 63 

GAC GTA GAT GGA GAG GTG AAG ATA GCT TTG ATT GGC ATT AAA CAG ATG AAA ATC ATG ATG 333 

BRRBBBHSKLMKTLKKCKEE 83 

GAA AGG AGA GAG GAA GAA CAC AGC AAA CTA ATG AAA ACC TTG AAG AAG TGC AAA GAA GAA 393 

KQBALKLMNBVHEKLBBEBS 103 

AAO CAG GAG GCC CTG AAA CTT ATG AAT GAA GTT CAT GAA CAC CTG GAG GAG GAA GAA AGC 453 

LCQVSLADSWDECRACLESK 123 

TTA TGC CAG GTT TCT CTG GCA GAT TCC TGG GAT GAA TGC AGG GCT TGC CTG GAA ACT AAC 513 

CM RPOTTCQPAWSSV KNMPA 143 

TGC ATG AGG TTT GAT ACC ACC TGC CAA CCT GCA TGG TCC TCT GTG AAA AAT ATG CCA GCC 573 

YRADAEPSWAIPNVFQLLCH 1S3 

TAG AGA GCA GAT GCT GAG CCA AGC TGG GCC ATT CCC AAT GTC TTC CAG CTG CTC TGC AAC 633 

LSF SVYQS VSBKLITTLRAT 183 

TTG ACT TTC TCA GTT TAT CAA AGT GTC AGT GAA AAA CTC ATC ACA ACC CTG CGT GCC ACA 693 

BDPPKQDKDS NQGGPZ SKZL 203 

GAG GAC CCT CCA AAA CAA GAC AAA GAC TCC AAC CAG GGA GGC CCG ATT TCA AAG ATA CTA 753 

PBQDRGSDGKItGQNLSDCVH 223 

CCT GAG CAA GAC AGA GGC TCA GAT GGG AAA CTT GGC CAG AAT TTG TCT GAT TGC GTT AAT fil3 

PRKRCQKCQDYLSDDCPHVP 243 

TTT OGC AAG AGA TGC CAG AAA TGC CAG GAT TAT CTA TCT GAT GAC TGC CCT AAT GTG CCT 673 



SLYRBLKBALRLVSRSNQQy' 263 

GAA CTA TAC AGA GAA CTC AAT GAG GCC CTC CGA CTG GTC AGT AGA TCC AAT CAG CAA TAC 933 

DQVVQHTQYBLBDTTLI.KB K 283 

GAC CAG GTG GTG CAG ATG ACC CAG TAT CAC CTG GAA GAC ACC ACQ CTT CTG ATG OAS AAG 993 

MRBOPGWVSBLAYQSPGAED 303 

ATG AGA GAG CAG TTT GGC TGG GTT TCT GAA CTG GCA TAC CAG TCC GCA GGA GCT GAG GAC 1053 

ZPH .PVKVMVALSAHBGHSSD 323 

ATC TTT AAT CCA GTG AAA GTA ATG GTA GCC CTA AGT GCT CAT GAA GOA AAT TCT TCT GAT 1113 

QDDTVVPSSLLPSSMPTLSS 343 

CAA OAT GAC ACA GTG GTT CCT TCA AGC CTC CTG CCT TCC TCT AAC TTC ACA CTC AGC ASC 1173 



PLEKSAGNA HPIORVVBKVL 363 
CCT CTT GAA AAG AGT GCT GGC AAC GCT AAC TTC ATT GAT CAC GTG GTA GAG AAG GTT CTT 1233 



QHPKBHPKTH* 374 
CAG CAC TTT AAG GAG CAC TTT AAA ACT TGG TAA 126G 

GAAGAl i 1 AU TC CA ltXT A TAATCAGCAAGAATTACA LC ri Ui GCCAAGACCTGAGAATTGTGAAAATACAAAOCACGC 1345 
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TAACACAATGAACACAGCTXXaVTGAAAGTTAW ^^^4 
AATAGCTATTCkAATTtyunTAATATAJUVAATTTC^ ^^^^ 
TTCTTTGTATACTAAATAAATACTCAGTCCCCT ^^^^ 
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GCAACCTCGTTOmyWSAGCCTGaw;^ ATG aL era CCA CTC TTC GTO 69 

IVYLLRLRDCQCAPTGKDRT 28 

ATT GTG TAT CTC Cro CGG CTO AGA QAC TCT GAG TGT GCG CCT ACA GGG AAG GAC CC^ 129 

^^"^B.DPK GFSKAGBIDVDEE 48 

TCC ATC CGT GAA GAC CCG AAA GGT TIT TCC AAG GCT GGG GAG ATA GAC GTA GAT GAA GAG X89 

^'^•^A^IGMKQM.KILMERREE €8 

GTG AAG AAG GCT TTG ATT GGC ATG AAG CAC ATG AAA ATC CIO ATG GAA AGA AGA GAO GAG 249 

^"S'^LMRTLICKCRBEKQEAL 88 

GAA CAT AGC AAA CTA ATG AGA ACA CTO AAO AAA TCC AGA GAA GAA AAG GAG GAG GCC CTG 309 

KLMNEVQBHLEEEE RLCQVS 108 

AAG CTT ATG AAT GAA GTT CAA GAA CAT CTA GAA GAG GAA GAA AGO CTA TCC CAQ GTG TCT 369 

I-HGSWD BCKSCLESDCMRPY 128 

ClOATOGGTTCCTGaGACGAATGCAAATCrWKrCTGGAAAGTGACTGCATOAaATTTTAT 429 



8SMKSTIERV PRK 



HEDDEKELPVGE 



148 



ACAACCT6CCAAAGCAGTTGGTCCTCTATGAAATCCACGATTQAACGGGTTTTCCGGAAG 489 



_ _ . . - „ 168 

ATA TAT CAG TTT CTC TTT CCT TTC CAT GAA GAC GATGAAAAAGA0CTTCCTGTT6GTGAG S49 

KPTEBDVQLMOIBNVFSQI.T 188 

AAO TTC ACT GAO GAA GAT GTA CAG CTO ATO CAG ATA GAG AAT GTG TTC AGC CAO CTO ACC 609 

VDVGFLYNMSFHVFKQMQQB 208 

GTG CAT GTG GGA TTT CTC TAT AAC ATG AGC TTT CAC GTC TTC AAA CAG ATG CAG CAA GAA 669 

PDI-AFQS YFMSDTDSMEPYP 228 

TTTGACCTCGCTTTTCAATCATACTTTATGTCAGACACAGACTCCATGGAGCCTTACTTT 729 

P^AJ'SKEPAKKAHPMQSWDI 248 

TTT CCA GCT TTT TCC AAA GAG CCA GCA AAA AAA GCA CAT CCT ATG CAG AGT TGQ GAC ATT 789 

P SFPQLFCNFSLSV VQSVSA 266 

CCC AGC TTC TTC CAG CTG TTT TOT AAT TTC AGC CTC TCT GTT TAT CAA AOT 6TC AGC GCA 849 

^V^BMI-KAIBDLSKQDKDSA 286 

ACA CTT ACA GAOATGCrOAAOGCCATTGAOGACTTATCCAAACAAGAC AAA OAT TCT GCC 909 

«<'<3PSfiTTHP VROROI-COBP 308 

CAC CGT GGA COO AGT TCC AGO AOO TGG CCT 6T0 OGQ GGC AGA GGS CTO TGT GGA GAA OCT 969 

GQMSSBCLQPHAR CQKCQD Y 328 

GGC CAG AAC TCG TCC GAA TGT CTC CAA TTT CAT GCA AGA TGC CAG AAA TGT CAG OAT.TAC 1029 

^WADCPAVPBLYTKADBAL-E 348 

CTA TGG GCA GAC TGC CCT CCT GTT CCT GAA CTA TAG ACA AAO GCG GAT GAO GCC CTT GAG 1089 

I-VKISHQQYAQVLQMTQHHL 368 

TTG OTC AAC ATA TCC AAT CAO CAG TAT GCC CAC GTA CTC CAO ATG ACC CAO GAT CAC TTO 1149 

EDTTYLM EKMREQPOWVTBL 388 

OAO GAC ACC AOO TAT CTG ATG GAG AAO ATG AGA GAG CAG TTT GGT TGG GTA ACA GAG CTO 1209 

ASQTPGSBNIFSFIKVVPGV 408 

CCC AGC CAG ACC CCA GGA AGC GAG AAC ATC TTC AGT TTC ATA AAG GTA GTT CCA GGT GTT 1269 

"BONPSKQDEKMIDISILPS 428 
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CAC GAA CGA AAT TTC TCC AAA CAA GAT OAA AAG ATO ATA GAC ATA AGC ATT CM CCT TCC 1329 

^"''■'''■'IPI-BBSAESSDFls 
TCT AAT TTC ACA CTC ACC ATC CCT err GAA GAA AGT GCT GAG AGT TCC GAC TTC ATT ACC iVbI 

TAC ATO CTO GCC AAA OCT CTA CAO CAT TTT AAO GAA CAT TTT AAA TCT TGG TAA i"* 

OCAGAGTATrreArrAGGGACGTTTGCTGATAGGAATAGATGGTTCTTAAAAGGGAAAAATOKauuUC^ 1S22 

ATACCrTGAAAAa^TATTCAACCTCATTAATAATCAAAGGCATGAAAACTAAGACAAGTTAGCAGTTTTTACCrATTGA 1601 

ATrrrCAAATTAAAAAAAAAAATCCTOATAGAATTOUVTOAAATGACAATTCTTATATGTQATTGCCSfi^^ 1S80 

GTTTTGTCrrTTTGAAAAGTTATTCAATTATACATATCAAGAGTCATCAAATTTCTTTTTAATATAATAATTCCACTTC 1759 

TaaATCAATCCAAAGGAGTAAATCTAAAATTGAATTGAAGTTCCCACCCCAAGATCAATATTTGCRAATTATTTA^ 183B 

TACTAAACTOTTAAAAACTGAATGTaVTCTGAATtmrnttAAACCAGAAAieGTTAAAAGCIOTOGCTAAATATC^ 1917 

AAAaATCTTATAAAACCATTAAAAAtAXTTATAAAATTTAAATCATQACATGACATCTCCXWSAACAAGAGTT^ 1996 

AAGCCrATCTATAAGGCRAATATTATTATTACTATCTTCCAGAAAAGAAACTTGAflACTCAGGGTCCAAGTOTT^ 2075 

^^^^ 

AATACTGGGGCAGGTTGCTATITCCTTCrCCAGGAAATCTTCCCTATCCAGGGATGaAACCCAaWTCCT^ 2J33 
GGTAOATGCTTTACTATCTSACCRACCAAATGAAITACTCAAGTCAGTAGGGGGTAaAGGCAAATTTTAACTT^^ 2312 
CTCIGAATCATAATTGCCaCATTRAACTGGTTCCroTTGGGRCATTTGGTIGAAAAAAATAAAGTG^^ 23S1 
AAACTCTATAAATGTAATGATCAAAACGAAAAAAAATCTACAATCTGCATTAAAAATAAAAAGGaTTGGCAGG 
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M K P P L 5 

CACAAGCWXnXXSCAACCTCGTTGGTGAGAGCXrrGCACnTAGTC^^ ATG AAG CCG CCA CTC 72 



FIVYLLRLRDCQCAPTGK 



IREDPKGFSKAGBIDV 



EEEHSKLM RTLKKCRBEKO 



25 



TTG GTG TTT ATT GTC TAT CTG CTO CGG CTG AGA GAC TGT CAG TGT GCG CCT ACA GGG AAG 132 



45 



GAC CGA ACT TCC ATC CGT GAA GAC CCG AAG GGT TTT TCC AAG GCT GGG GAG ATA GAC 6TA 192 

OEEVKKALIGM KQ MKILMER 65 
GAT GAA GAG GTG AAG AAG GCT TTG ATT GGC ATG AAG CAG ATG AAA ATC CTG ATG GAA ACA 252 



85 



ACA GAG GAG GAA CAT AGC AAA CPA ATG AGA ACA CTG AAG AAA T6C AGA GAA GAA AAG CAG 312 

BALK LMNBVQBHLBBBSRLC 105 

GAG GCC CTG AAG CTT ATG AAT GAA GTT CAA GAA CAT CTA GAA GAG GAA GAA AGG CTA TXKT 372 

QVSLNGSHDBCKSCLBSDCH 125 

CAG GTG TCT CTO ATG GGT TCC TGO GAC GAA TGC AAA TCT TGC CTG GAA ACT GAC TGC ATG 432 

'^P'^TTCQSSWSSMKSTIERV 145 

AGA TTT TAT ACA ACC TGC CAA AGC ACT TGG TCC TCT ATG AAA TCC AOG ATT GAA CGG GTT 492 

f'RKIYQFLFPFHEDDEKELP 165 

TTC CGG AAG ATA TAT CAG TTT CTC TTT CCT TTC CAT GAA GAC GAT GAA AAA GAG CTT CCT 552 

VGEKFTEEDVQLMQ IENVPS IBS 

CTT GGT CAO AAG TTC ACT GAG GAA GAT GTA CAG CTG ATG CAG ATA GAG AAT GTG TTC AGC 612 

QLTVDVGFI.yNMSPHVPKQM 205 

CAG CTG ACC CTG GAT GTG GGA TTT CTC TAT AAC ATG ACC TTT CAC GTC TTC AAA CAO ATG 672 

QQBF OLAFQS YFHSDTDSHE 225 

CAO CAA GAA TTT GAC CTG GCT TTT CAA TCA TAG TTT ATG TCA GAC ACA GAC TCC ATG GAG 732 

P^P^'PAPSKEPAKKAHPMQS 245 

CCT TAC TTT TTT CCA GCT TTT TCC AAA GAG CCA GCA AAA AAA CCA CAT CCT ATG CAG ACT 792 

HDXP SFPQLPCNPSLSVYQS 265 

TGG GAC ATT CCC AGC TTC TTC CAG CTG TTT TGT AAT TTC AGC CTC TCT GTT TAT CAA ACT 852 

V S A TVTBMLKAIBDLSKQDK 285 

GTC AGC GCA ACA GTT ACA GAG ATG CTG AAG GCC ATT GAG GAC TTA TCC AAA CRA GAC AAA 912 

DSABOGPSSTTHPVR'ORGLC 305 

eUVr TCT CCC CRC cot GOA coo act TCC ACO acq TGO CCT GTG CGG GGC AGA GGG CTC TGT 972 

^BPOQHSSBCLQPHARCQK.C 325 

OQA GAA CCT GGC CAO AAC TOO TCC GAA TGT CTC CAA TTT CAT CCA AGA TGC CAO AAA IGT 1032 

QDY LHADCPAVPBLYTKADB 345 

CAO GAT TAC CTA TGO CCA GAC TGC CCT GCT GTT CCT GAA CTA TAC ACA AAC GCO OAT GAG 1092 

ALBLVHISHQQYAQVLQMTQ 365 

GCC CTT GAG TTC CTC AAC ATA TCC AAT CAG CAG TAT GCC CAG GTA CTC CAO ATC ACC CAG 11S2 

UHLBDTTYLHBKHRBQFOHV 305 

CAT CAC TTO GAG OAC ACC ACG TAT CTO ATG GAG AAO ATG AOA CAO CAO TTT GOT TOO OTA 1212 

TELASQTPGSE HIFSPIKVV 405 

ACA GAG CTG GCC AGC CAG ACC CCA GGA AGC GAG AAC ATC TTC AGT TTC ATA AAG GTA GTT 1272 
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L^Z..L±± .^.Z_J K O D K K H I o I S I «5 



CCA GGT GTT CAC CAA GGA AAT TTC TCC AAA CAA GAT OAA AAG ATX, ATA GAC ATA A^C A^T x\ll 

cJis C^ T^ TCT A^T TO A^A C^C aL ATC C^ c5^ gL gL A^ gL A^ OAC x\ll 

^^^^""•^KAVQHPKEHPKQu 

TTC AIT AGC TAC ATG CTG GCC AAA GOT GTA CAG CAT TIT AAO GAA CAT TTT AAA TCT TGG ll" 



TAA *66 

14SS 

CCAGAGTATTTGATTAGGGACGTTTGCIX»TAGGAATAGATGGTTCTTAAAAGGGAAAAATCACAAAACTAGCTTT^ tS3A 

ATACCTTGAAAACGTATTCAACCTCATTAATAATOUUUKKZATGAAAACTAAGACAACTrAGaVGTTTTTACCTATT^ 1613 

ATTTTCAAATTAAAAAAAAAAATCCTGATAGAATGCAATGAAATGAGAATTCTTATATCn^TTCCCflGAAAaiAACTC 1692 

GTTTTOTCTTTTTGAAAAGTTATTCAATTATACATATCAAGAGTCATCAAATTTCrTTTTAATATAATAATTCCACTTC 1771 

TGGAATCAATCCAAAaSAGTAAATCTAAAATTGAATTGAAGTTCCCACCCCAACATCAArATTTCCAAA laso 

TAOTAAACTGTTAAAAACTGAATffTCATCItJAATGTCIM^^ ^^^S 

*AATATCTTATAAAA0aiTTAAAAATRTTTA3aAAATrTAAATCATX3tfaTCACATCT« 2008 

AAGCCTATCTATAAGCCAAATATrATTATTACTATCTTCaVGAAAAGAAACTTGA(»CTCAGGGTCCAACTCTT^ 2087 
CICAGTCATOTCIGACTCTTTGAGACCCCTTGGACTtmXKXCACCAGGCTCCTCTGTCCAXG 

AATWmWAGCAGGTTCCTATTTCCTTCTCCAGOAAArCTTCCCIATCCAGGCaTGGAACCC^ 224S 

GGTAGATGCTTtActATCTGAGCAACCAAATGAATTACTCAAGTCAGTACGGGGTAGAGGCRAAITTTAAOT 2324 

CTCKa«TCATAATTGCCACATTAAACIGGTTCCTGTIGGGACRTTTGGTTGAAAAAAATAAAGTGAAAAAlta^ 2403 

AAACrCTATAAATCTAATOATCAAAACGAAAAAAAATCTACAATCTGCATTAAAAATAAAAACGGTT*^^ 2482 

SgrrGGAAATGBATqATTTTTTTTA A CCli 1 l U lXLil Uia TATTTTACAArrTICTATAATaAATAAAIAATTTTGA 2561 

GATTTCAAATTAGAAGATATGTTGCrAAAATAGCTAGGTAAAXGTAGATTGAACRCTGiaTCAATGTa^^ 2640 

2,„ 

ACaUUaaAATGACATTTCAGAOGTACATTACCAlCTCTOTTAOaATAATCTTCIGAATTaA 2798 

ACaaaOTAITCTCCTTrGGTAAAATGtmSUVTCrrAAAGAAGCATTAAATOTTAATTCPAaO^^ 2877 ■ 

a.iiuijuiuiiMiGim.TArcAimnMi^ ^ ^^^^ 

"WaaWWTTOTTaOTCATCAAlCAACCA&TATCTTTTIAEXaTCTTCI^^ 
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N K 2 

CTGAACGTCXTTACAGAAGCTtKnGGCAACCTXXnTGGT^ ATG AAG 7S 

PPILVPIVYLLQLRDCQCAP 22 

CCG CCA ATC TTG GTG TTT ATC GTG TAT CTG CTG CAG CTG AGA GAC TGT CAG TCT GCG CCT 136 

TGKDRTSIREDPKGFSKAGE 42 

ACA GGG AAG GAC CGA ACT TCC ATC CCT GAA GAC CCG AAG GGT TTT TCC AAG GCT GGG GAG 196 

lOVOBBVKKALIGMKQMKIL £2 

ATA GAC GTA GAT GAA GAS GTG AAG AAG GCT TTG ATT GGC ATG AAG CAG ATG AAA ATC CTG 256 

MBRRBBBH5 KLMRTLKKCRB 62 

AtX3 GAA AGA AGA GAG GAG GAA CAT AGC AAA CTA ATG AGA ACC CTG AAG AAA TGC AGA GAA 316 



KKQEALKLMNE 
GAA AAG CAG GAG GCC CTG AAG CTT ATG AAT GAA 

RLC0VSLM6SN 
AGG CTA TGC CAG GTG TCT CTQ ATG GGT TCC TGG 



V QBH LEBBC 102 

GTT CAA GAA CAT CTA GAA GAG GAA GAA 376 

OECKSCLBS 122 

GAC GAA TGC AAA TCT TGC CTG GAA AGT 436 



D CMRPVTTCQSSWSSMKSTI 142 

GAC TGC ATG AGA TTT TAT ACA ACC TGC CAA AGC AGT TGG TCC TCT ATG AAA TCC ACG ATT 496 

BRVFRKIYOFX'F PFHEDOBK 162 

GAA CCG GTT TTC CGG AAG ATA TAT CAG TTT CTC TTT CCT TTC CAT GAA GAC GAT GAA AAA 556 

BLPVGEKPTEE DVQLMQIEN 182 

GAG CTT CCT GTT GGT GAG AAG TTC ACT GAG GAA GAT GTA CAG CTG ATG CAG ATA GAG AAT 616 

VFSQLTVOVGFLYNHS PHVF 202 

GTG TTC ACC CAG CTG ACC GTG GAC GTG GGA TTT CTC TAT AAC ATG AGC TTT CAC GTC TTC 676 

KQHQQBPDLAPQSYPMSDTD 222 

AAA CAG ATG CAG CAA GAA TTT GAC CTG GCT TTT CAA TCA TAC TTT ATG TCA GAC ACA GAC 736 

SNBPYPFPAFSKEPAKKAHP 242 

TCC ATG GAG CCT TAC TTT TTT CCA GCT TTT TCC AAA GAG CCA GCA AAA AAA GCA CAT CCT 796 

HQSHOXPSPFQI«FCNPSXiSV 262 

ATG CAG AGT TGG GAC ATT CCC AGC TTC TTC CAG CTG TTT TOT AAT TTC AGC CTC TCT GTT 856 

YQSVSAT VTBHLK AIEDLSK 262 

TAT CAA AGT OTC AGC GCA ACA GTT ACA GAG ATG CTG AAG GCC ATT GAG GAC TTA TCC AAA 9X6 

Q D K D S A H G G -P-^S S T T H P V R G.R ..._A0?.. 

CAA GAC AAA GAT TCT GCC CAC GGT GGA COG AGT TCC ACG ACG TGG CCT GTG CGG GGC AGA 976 

a LCGBPOQNSSBCLQPHARC 322 

OGG era TGT GGA GAA CCT GGC CAG AAC TCG TCC GAA TGT CTC CAA TTT CAT GCA AGA-TGC 1036 

QKCQDYI.WADCPAVPBLYT'k 342 

CAG AAA TGT CAO GAT TAC CTA TGG GCA GAC TGC CCT GCT GTT CCT GAA CTA TAC ACA AAG 1096 



ADEALBI«VHISNQQYAQVLQ 362 

GCG GAT GAG GCC CTT GAG TTG GTC AAC ATA TCC AAT CAG CAG TAT GCC CAG GTA CTC CAG 1156 

HTQHHLBDTTYLMEKMRE QF 382 

ATG ACC CAG CAT CAC TTG GAG GAC ACC ACG TAT CTG ATG GAG AAG ATG AGA GAG CAG TTT 1216 

GWVTBLASQTPG SBNI FSFI 402 

GGT TGG CTA ACA GAG CTG GCC AGC CAG ACC CCA GGA AGC GAG AAC ATC TTC AGT TTC ATA 1276 

KVV PGV H B O N P S KQ D B K M X D 422 
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AAO GTA GTT OCA CGT CTT CAC GAA GGA AAT TTC TCC AAA CAA OAT GAA AAG ATC ATA GAC 1336 

^^^^P^^NPTLTIPL BESAE 442 
ATA AGO Arr CTO OCT TCC TCT AAT TTC AOl CTC ACC ATC CCT CTT GAA CAA ACT OCT GAG 1396 

^^'^^'^ ^^"I'AKAVQHPKBHP 462 
ACT TCC OAC TTC ATT AGC TAC ATC CTG GCC AAA GCT GTA GAG CAT TTT AAG GAA CAT TTT 1456 

K S If * 
AAA TCT TGG TAA 

1468 

GCAGAGTATTTGATTAGGGACGTTTGCTGATAGGAATAGATGGTTCTTAAAAGGGAAAAATGACAAA^ I547 
ATACCTTGAAAACGTATTCAACCTCRTTAATAATCAAAGGCATGAAAACTAAGACAACrTAGOW^^ 1626 
ATTTTCAAATTAAAAAAAAAATCCTGATAGAATt^TCWlATaAGAATTCTTATAT^ 

1 lllblLl 1 niUAAAAGTTArrCAATTATACATATCAAGAGTCATCAAATTTCTTTTTAATAT 1734 
GGAATCAATCCAAAGGAGTAAATCTAAAATTGAATTGAACTTCCCACCCCAAGATCAATATrr^^ 1863 

AATATCTTATAAAACCATXAAAAATATTTATAAAATTTAAATCATGACATGAC^ 2021 

AGCCTATCTATAAGGOUlATATTATTATrACTATCTTCCAGAAAAGAAACTTGAGACTCACG^ 2100 

TeAGTCATGTCIGACTCTTTaAGACCOCrrGaACTGTAGCCCACCAGGCTCCTCTGTC^ 2173 

ATACTGGAOCAGGTTGCTATTTCCTTCrCCAGGAAATCTTCCCTATCCAGGGATGGAACCCAGGTCrC^ 2258 

CTAGATGCTTTACTATCTGACCAACCAAATGAATTACTCAAGTCAGTACGGGGTAGAGGCAA^ 2337 

TCTGAATCATAATTGCCACRrrAAACTGGTlCCTGTTGGGACATTTGGTTGAAAAAAATAAAGTG^ 2416 
AACTCTATAAATGTRATGATCAAAACGAAAAAAAATCTACAATCTGCATTAAAAATAAAAAGGGTTGGCAG^ 
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CTTGGAGTCAACTGAGTGTGQACTGAAACTTCCAAAAACT Majority 
I .1 I I 

10 20 30 40 

1 1 1 U 

1 CTTCGAGTCAACTGAOTGTGGACTGAAACTTCCAAAAACT gphkngl 815-1. 
1 CTTGGAGTCAACTGAGTGTGGACTGAAACTTCCAAAAACT gp7b-l. 
1 CTTGGAGTCAACTGAGTGTGGACTGAAACTTCCAAAAACT gp7c-l. 
1 CTTGGAGTCAACTCAGTGTGGACTGAAACTTCCAAAAACT gp7d-l. 

GACAT GAG GAGTCACTGGAGAATCATGATCAAGGAGCTAC KaioriCv 

1 -T 1 — T ' 

50 60 70 80 

— —J 1 1 1» 

41 GACATGAGGAGTCACTGGAGAATCATGATCAAGGAGCTAC gph)cngl815-l, 
41 GACATGAGGAGTCACTGGAOAATCATGATCAAGGAGCTAC gp7b-l. 
41 GACATGAGGAGTCACTGGAGAATCATGATCAAGGAGCTAC gp7c-l, 
41 GACATGAGGAGTCACTGGAGAATCATGATCAAGGAGCTAC gp7d-l. 

ACACTCTGACTTAACTTTATTCTGTGGACAATGAGAGACA Maioritv 
1 1 1 T ^ 

90 100 110 120 

1 1 I I. 

81 ACACTCTGACTTAACTTTATTCTGTGGACAATGAGAGACA gph)cngl815-l. 
81 ACACTCTGACTTAACTTTATTCTGTGGACAATGAOAGACA gp7b-l. 
81- ACACTCTGACTTAACTTTATTCTGTGGACAATGAGAOACA gp7c-l. 
81 ACACTCTGACTTAACTTTATTCTGTGGACAATGAGAGACA gp7d-l, 

ACTGCAAGGATTAACAGTGAGAACATGAAGCTGCCACTTT Majority 
( • I I 

130 140 ISO 160 

1 1 1 t. 

ACTGCAAGGATTAACAGTGAGAACATGAAQCTOCCACTTT gph)cngl815-l . 
ACTGCAAGGATTAACAGTGAGAACATGAAGCTGCCACTTT gp7b-l . 
ACTGCAAGGATTAACAGTGAGAACATGAAGCTGCCACTTT gp7c-l. 
121 ACTGCAAGGATTAACAGTGAGAACATGAAGCTGCCACTTT gp7d-l . 

TGATGTTTCCCGTGTGTC TGCTATGGTTGAAAGACTGTCA Majority 
P , , P 

170 ISO 190 200 

1 I I 1. 

161 TGATGTTTCCCGTGTGTCTGCTATOGTTOAAAQACTQTCA gphknglOlS-l . 
161 TaATOTTTCCCGTOrOTCTGCTAVGGTTGAAAGACTOTCA gp7b-l. 
161 TOATGTTTCCCOTGTGTCTGCTA^GGTTGAAAGACTGTCA gp7c-l. 
161 TQATGTTTCCCGTGTGTCTGCTATGGTTGAAAGACTGTCA gp7d-l . 

TTQTGCACC 9ACTTGGAA6GACAAAACTGCCATCAQTGAA Majority 
1 1 1 r 

210 220 230 240 

1 ^ I t. 

201 TTGTGCACCTACTTGaAAQGACAAAACTOCCATCAGTOAA gphknglBlS-l . 

201 TTGTGCACCTACTTGGAAGGACAAAACTQCCATCAGTGAA gp7b-l. 

201 TTOTGCACCTACTTOGAAGGACAAAACTGCCATCAGTGAA gp7c-l . 

201 TTOTGCACCTACTTGGAAGGACAAAACTGCCATCAGTGAA gp7d-l. 

AACGCGAACAGTT T T T C T G AGGCT6GGGAGATAGACGTAG Majority 

I 1 \ r 

250 260 270 280 

1 1 I L- 

241 AACGCQAACAGTTTTTCTGAGGCTGGGGAGATAGAC6TAG gphkngl815-l. 

241 AACGCGAACAGXTTTTCTGAGGCTOGGOAGATAGACOTAG gp7b-l. 

241 AACGCGAACAGTTTTTCTGAGGCTGGGGAGATAGACGTAG gp7c-l . 

241 AACGCGAACAOTTTTTCTGA6GCTGGGGA0ATAGACGTAG gp7d-l. 



121 
121 
121 
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281 
281 
281 
281 



321 
321 
321 
321 



ATGGAGAOGTGA AGATAGCTTTGATTGGCATTAAACAGAT Majority 

"0 300 310 320 

' ' ■ I 

ATCGAOACGTGAAGATAGCTTTGATTGGCATTAAACAGAT gphkaglSlS 

ATGGAGAOGTOAAGATAOCTTTGATTGGCATTAAACAGAT gp7b-l . 

ATGGAGACGTGAAGATAGCTTTGATTGGCATTAAACAGAT gp7c-l. 

ATGGA6ACGTGAAGATAGCTTTGATTGCCATTAAACAGAT gp7d-l [ 

OAAAATCATGA TGQAAAGGAQAGAGGAAGAACACAGCA A A Majority 

^3*> 340 350 360 

1 * ' ^ |_ 

GAAAATCATGATGGAAAGGAGAGAGGAAGAACACAGCAAA gphknglBlS 
GAAAATCATGATGGAAAGGAGAGAGGAAGAACACAGCAAA gp7b-l . 
GAAAATCATGATGGAAAGGAGAGAOGAAQAACACAGCAAA gp7c-l! 
OAAAATCATOATGGAAAGGAGAGAGGAAGAACACAGCAAA gp7d-l! 



TAATGA AAACCTTGAAGAAGTGCAAAGAAGAAAAGCAGG Majority 
370 380 390 400 

361 CTAATGAAAACCTTGAAGAAGTGCAAAOAA6AAAAGCAGO gphknglBlS- 

CTAATGAAAACCTTGAAOAAGTGCAAAGAAGAAAAGCAGG gp7b-l. 

CTAATGAAAACCTTGAAGAAGTGCAAAGAAGAAAAGCAGG gp7c-l! 

CTAATGAAAACCTTGAAGAAGTOCAAAGAAGAAAAGCAGO gp7d-l! 

AGGCC CTGAAACTTATGAATGAAGTTCATGAACACC TOGA Majority 
. * I I I 

420 430 440 

AGGCCCTGAAACTTATGAATGAAGTTCATGAACACCTGGA gpWmglBlS- 

AGGCCCTGAAACTTATGAATGAAOTTCATOAACACCTGGA sp7b-l. 

AGGCCCTGAAACTTATGAATOAAGTTCATOAACACCTGGA gp7c-l. 

ACGCCCTGAAACTTATGAATOAAOTTCATGAACACCTGGA gp7d-l. 

OGAQG AAGAAAQCTyATGCCACGyTT CTCTGGCAGATTCC Majority 

r- - — 1 1 r 

*50 460 470 4S0 

■■■ ' ' I ■ 

GOAOGAAGAAAQCTTATOCCAGGTTTCTCTOGCAGATTCC gphknglSlS- 
GOAGOAAGAAAGCTTATOCCAOGTTTCTCTGGCAGATTCC gp7b-l . 
OOAOOAAGAAAOCTTATOCCAGOTTTCTCTOGCAQATTCC gp7c-l. 
441 OOAOOAAOAAAOCTTATOCCAOOTTT CTCTGGCAGATTCC gp7d-l. 

TGGGA TOAATGCAGQOCTTGCCTQQAAAGTAA CTGCATGA Majority 
I r— , 1- 

*?0 500 510 520 

TQGGATGAATOCAGOGCTTGCCTQGAAAOTAACTGCATGA gphknglSlS- 
TGGGATGAATGCAGGGCTTGCCTGGAAAOTAACTGCATGA gp7b-l. 
TOGOATGAATGCAG6GCTTOCCTGGAAAGTAACTGCATQA gp7c-l, 
TQGGATOAATGCAGGGCTTGCCTGGAAAOTAACTGCATGA gp7d-l. 



'361 
361 
361 



401 
401 
401 
401 



441 
441 
441 



481 
481 
481 
481 



QGTTT GATACCACCTGCCAACCTGCATGGTC CTCTGTGAA Majority 
• ■ I 

530 540 550 560 

' « I t 

521 OGTTTGATACCACCTGCCAACCTGCATGGTCCTCTGTGAA gphJoiglBlS- 

S21 GOTTTGATACCACCTGCCAACCTOCATGGTCCTCTOTGAA 9p7b-l. 

OQTTTGATACCACCTGCCAACCTOCATGQTCCTCTOTGAA gp7c-l. 

GOTTTGATACCACCTOCCAACCTGCATGGTCCTCTGTOAA gp7d-l. 



521 
521 
521 
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AAATATGG 



561 
561 
561 
561 



— I — 

570 



600 



Majority 



580 590 

1 ' ' ' 

A A A T A T Q G|TGGAACAGTT TTTCAGGAAGATCTATCAGTTtI cphknglSlS-l 

AAATATGG , 

------- gp7D-i. 

AAATATGG , 

□ — — gp/c-1 . 
- - gp7d-l. 



601 
569 
569 
568 



— I — 

610 
J. 



AAAATGACAGAAGTGGCCCTGTCA Majority 



620 
JL 



630 
-L. 



T 
640 



ICTGTTTCCTCTCCAGgIa AAATCACAGAAGTGGCCCTGTCA gphtaglB15-l. 

AAAATGACAGAAGTGGCCCTGTCA gp7b-l. 

3p7c-l. 



gp7d-l. 



641 
593 
569 
56& 



681 
633 
569 
568 



GCAAAGGGGTCACTGAGGAAGATGCGCAGGTGTCAC A C A T Majority 
I - • I ' , — P 

650 660 670 680 

' 1 . 1 I 

GCAAAGGGGTCACTGAGGAAGATGCGCAGQTGTCACACAT gphknffl815-l. 
GCAAAGQGGTCACTGAGGAAGATGCGCAGGTOTCACACA T gp7b-l. 

1gp7c-l . 
gp7d-l. 



AGAGCATGTGTTCAGCCAGCTGAGCGCAGATGTGACATCT Majority 



690 

JL 



700 
-L, 



710 



720 



AGAOCATGTOTTCAGCCAGCTGAGCOCAGATGTGACATCT gphkngl815-l. 
AGAGCATGTGTTCAGCCAGCTGAGCGCAGATGTGACATCT gp7b-l. 

gp7c-l. 



gp7d-l. 



721 
673 
569 
568 



761 
713 
569 
S68 



801 
753 
569 
568 



CTCTTCAACAGAAGCCTTTACGTCTT C AAACAGCTGCGGC Majority 

r- 1 1 : r 

730 



740 
JL 



750 



760 
J. 



CTCTTCAACAGAAGCCTTTACGTCTTCAAACAGCTGCGGC gphJcngl815-l. 
CTCTTCAACAGAAGCCTTTACGTCTTCAAACAGCTGCQQC gp7b-l, 

gp7c-l . 



gp7d-l. 



GAGAATTTGACCAGOCTTTTCAQTCATATTTCACATCGGG Ka jorlty 
• ■ i I 



770 

JL 



780 



790 



800 

JL. 



G A OA ATTTOACCAGGCTTTTCAGTCATATTTCACATCGGO gph3aigl8l5-l- 
GACAATTTOACCAQOCTTTTCAGTCATATTTCACATCGQG gp7b-l. 

gp7c-l. 



gp7d-l. 



GACTQACGTTACAGAGCCTTTCTTTTTTCCATC T T T G T C C Majority 
• « I ' r 



810 
J. 



820 
J. 



830 



840 
J- 



GACTGACGTTACAGAQCCTTTCTTTTTTCCATCTTTGTCC gphtaigl81S-l. 
GACTQACGTTACAGAGCCTTTCTTTTTTCCATCTTTGTCC gp7b-l. 

gp7c-l. 
gp7d-l. 
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AAGGAGCCAQCCTACAGAQCAGAT GCTGAOCCAAGCTGGG Kaioritv 
1 " — — ■ r^^ — ■ — 1- 

850 860 870 880 

— 1 — JL_. I 

841 AA6QAGCCA0CCTACAGAGCAGATGCT6AGCCAA0CTGGO gphkngl815-l . 
793 A AC C A GCCAGCCTACAQAGCAGATGCTGAQCCAAGCrGGG cp7b-l. 



569 
568 



881 
833 
605 
602 



-]a G^ C CAGCCTACAGAGCAGA7GCTGAGCCAAGCTGGG gp7C"l, 



CAAGCrGGG ffp7b-l. 
CAAGCTGGG gp7c-l . 
CCAGCCTACAGAGCAGATGCTGAOCCAAGCTGGG gp7d-l, 

CCATTCCCAATGTCTTCCAGCTGCTCTGCAACTTG A G T T T Maioritv 
• I 1 T 

890 900 910 920 
t . i_ I ■ 

CCATTCCCAATGTCTTCCAGCTGCTCTGCAACTTGAGTTT gph)cngl8X5-l . 

CCATTCCCAATGTCTTCCAGCTGCTCTGCAACTTGAGTTT gp7b-l . 

CCATTCCCAATGTCTTCCAGCTGCTCTGCAACTTGAGTTT gp7c-l. 

CCATTCCCAATGTCTTCCAGCTGCTCTGCAACTTGAGTTT gp7d-l. 

CTCAOTTTATCAAAGTQTCAGTGAA AAACTCATCACAACC Uaioritv 
» i I I 

930 940 950 960 

i 1— 1 L. 

921 CTCAGTTTATCAAA6TOTCAGTQAAAAACTCATCACAACC gphknglSIS-l. 
873 CTCAOTTTATCAAAGTGTCAGTGAAAAACTCATCACAACC gp7b-l. 
645 CTCAGTTTATCAAAOTGTCAGTGAAAAACTCATCACAACC gp7c-l. 



642 



— — - - ---- V •w*v«w *%>nnnAMV* a\»aak*a\.AA\^V. vP'C-jL. 

CTCAGTTTATCAAAGTGTCAGTGAAAAACTCATCACAACC gp7d-l. 

CTQCGTGCCACAQAGGACCCTCCAAAA C A AGAC A A A G A C T Kaiorlty 
I I I I 

970 980 990 1000 

« 1 1 L. 

961 CTGCGTGCCACAOAGOACCCTCCAAAACAAGACAAAGACT gphkngl815*l . 
913 CTGCGTGCCACAGAGGACCCTCCAAAACAAGACAAAGACT gp7b-l. 
685 CTGCGTGCCACAGAGGACCCTCCAAAACAAGACAAAGACT gp7c-l. 
682 CTGCGTGCCACAGAOGACCCTCCAAAACAAGACAAAGACT gp7d-l. 

CCAACCAGGGAGGCCCGATT TCAAAGATACTACCTGAGCA Kaiority 

1 — — I —I r 

1010 1020 1030 1040 

* t t I 

1001 CCAACCA6GGAGGCCCGATTTCAAAGATACTACCTGA6CA gph)mgl815-l. 

953 CCAACCAGG G A GOCCCGATTTCAAAGATACTACCTGAGCA gp7b-l. 

725 CCAACCAGGGAGGCCCGATTTCAAAGATACTACCTGAOCA gp7c-l. 

722 CCAACCAGG GAG OCCCGATT TCAAAGATACTACCTGAGCA gp7d-l. 

AOACAOAGGCTCAGATQO GAAACTTGGCCAOAATTTOTCT Kajority 
1 : 1 1 r 

1050 1060 1070 1080 

1 1 I U 

1041 AOACAOAOQCTCAOATGOOAAACTTGGCCAOAATTTOTCT gphlcngl815-l. 
993 AOACAGAOGCTCAGATQOOAAACTTGQCCAGAATTTOTCT gp7b-l . 
765 AGACAGAOGCTCAOATGOGAAACTTGGCCAOAATTTGTCT gp7c-l . 
762 AOACAGAGGCTCAGATGOGAAACTTGGCCAOAATTTGTCT gp7d-l . 

QATTGCGTTAATTTTCGCAAGAGATGC C A O AAA T G C C A G G Kajority 
I 1 1 r 

1090 1100 1110 1120 

< 1 1 L, 

1081 OATTGCOTTAATTTTCGCAAGACATOCCAOAAATOCCAGG gphknglBlS-l. 
1033 GATTGCOTTAATTTTCGCAAGAOATGCCAOAAATGCCAQG gp7b-l. 
BOS GATTGCG7TAATTTTCGCAAGAOATOCCAOAAATGCCAGG gp7c-l. 
802 OATTGCOTTAATTTTCGCAAGAGATQCCAGAAATOCCAGG gp7d-l. 
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ATTATCTATCTGATGACTGCCC TAATGTGCCTOAACTATA Majority 

, ]^ USO 1160 

*»»ATCTATCTOATOACTGCCCTAATOTGCCTGAACTATA gphJcn«l815-l 
1073 ATTATCTATCTGATGACTGCCCTAATGTGCCTGAACTATA flp7b-l 
B4S ATTATCTATCTGATGACTGCCCTAATGTGCCTGAACTATA gp7c-l" 
842 ATTATCTATCTGATGACTGCCCTAATGTGCCTGAACTATA gp7d-l." 

CAGAGAACTCAATG AGGCCCTCCCACTGGTCAGTAGATCC Hajority 
. ]^ U80 1190 1200 

CAGAGAACTCAATGAGGCCCTCCGACTGGTCAGTAGATCC gphtaglSlS-l 

1113 CAGAGAACTCAATGAGGCCCTCCGACTGOTCAGTAGATCC gp7b-l 

885 CAOAGAACTCAATGAGGCCCTCCGACTOGTCAOTAOATCC «>7c-l' 

882 CAGA0AACTCAATGAGGCCCTCC6ACTGGTCAGTAGATCC gp7d-i; 

AATCAGCAATACGA CCAGGTGGTGCAGATGACCCAOTATC Majority 

".^0 1220 1230 1240 

' AATCAOCAATACGACCAOGTOOTOCAGAtOACCCAOTATC flphknglBlS-l 

1153 AATCAOCAATACGACCAOGTGOTOCAGATOACCCAOTATC 9p7b-l 

925 AATCAOCAATACGACCAOGTOOTOCAGATGACCCAGTAIC gp7c-l' 

922 AATCAGCAATACGACCAGGTGOTOCAGATGACCCAGTATC gp7d-l! 

ACCTCGAAGACACCACGCTTC TGATOOAOAAGATOAOAGA Majority 
^fSO 1260 1270 1280 

ACCTOGAAGACACCACOCTICTGATGGAGAAGATGAGAGA 9PhknglB15-l. 
1193 ACCTGGAAGACACCACGCTTCTGATGGAGAAGATGAGAGA gp7b-l 
965 ACCTGGAAGACACCACGCTTCTGATGGAGAAGATGAGAGA gp7c-l 
962 ACCTOGAAOACACCACGCTTCTGATGOAOAAGATGAGAGA gp7d-l. 

OCAGTTTCGCTGOGTTTCT OAACTGOCATACCAGTCCCCA Majority 

1300 1310 1320 

OCAGTITGGCTGGGTTTCTGAACTGGCATACCAOTCCCCA gphlcngl815-l. 

1233 OCAGTTTOOCTOGGTTTCTGAACTOGCATACCAOTCCCCA gp7b-l, 

1005 OCAOTTTOOCTGOGTTTCTOAACTOGCAIACCAGTCCCCA gp7c-l. 

lOOaOCAOTTTOOCTOGOTTTCTGAACTOOCATACCAGTCCCCA gp7d-l. 

OOAOCTGAOOACA TCTTTAATCCAOTGAAAQTAATOOTAG Majority 

"3° "10 1350 1360 

X321 00 A6CTGAOOACATCTTTAAICCAOTGAAAOTAATOOTAO flphkngl81S-l. 

1273 OGAGCTGAOGACATCTTTAATCCAOTOAAAOTAATOOTAO 9p7b-l. 

1045 OeAOCTOAGGACATCTTTAATCCAGTaAAAGTAATCGTAG gp7e-l. 

1042 GGAGCTGAGGACATCTTTAATCCAGTOAAA0TAATGOTAO gp7d-l. 

CCCTAAGTGCTCA TOAAOGAAATTCTTCTGATCAAOATGA Majority 

1380 1390 1400 

1361 CCCTAAOTGCTCATGAAGGAAATTCTTCTGATCAAGATGA gphtaiglBlS-l. 
1313 CCCTAAGTGCICATGAAGGAAATTCTTCTGATCAAGATGA gp7b-l. 
1085 CCCTAAGTGCTCATGAAOGAAATTCTTCTGATCAAOATGA gp7c-l. 
1082 CCCTAACTOCTCATOAAGQAAATTCTTCTGATCAAGATGA gp7d-l. 
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CACAGTGGTTCCTTCAAGCCTCCTGCCTTCCTCTAACTTC Majority 

1^10 1420 1430 1440 

* ■ ' I 

1401 CACAGTGGTTCCTTCAAGCCTCCTGCCTTCCTCTAACTTC gph)cngl815-l. 

1353 CACAGTGGTTCCTTCAAGCCTCCTGCCTTCCTCTAACTTC gp7b-l. 

1125 CACAGTGGTTCCTTCAAGCCTCCTGCCTTCCTCTAACTTC flp7c-l. 

1122 CACAGTGGTTCCTTCAAGCCTCCTGCCTTCCTCTAACTTC gp7d-l. 

ACACTCAGCAGCCCTCTTGAAAAGAGTGCTGGCAAC G C T A Majority 

1450 1460 1470 1480 

—J — 1 I ' 

1441 ACACTCAGCAOCCCTCTTGAAAAGAGTGCTCGCAACGCTA gphkaiglBlS-l . 
1393 ACACTCAGCAGCCCTCTTGAAAAGAGTGCTGGCAACGCTA gp7b-l . 
1165 ACACTCAGCA6CCCTC T T GAAAAGAGTGCTGGCAACGCTA gp7c-l. 
1162 ACACTCAGCAOCCCTCTTGAAAAGAGTGCTGGCAACGCTA gp7d-l . 

ACTTCATTGATCACGTQQTAGA GAAGGTTCTTCAGCACTT Haioritv 
I , , _ p 

1^90 1500 1510 1520 

* ' I I 

1481 ACTTCATTGATCACGTGGTAGAGAAGGTTCTTCAGCACTT gph)cngl815-l . 
1433 ACTTCATTGATCACGTGGTAGAGAAGGTTCTTCAGCACTT gp7b-l. 
1205 ACTTCATTGATCACGTGOTAGAGAAGGTTCTTCAGCACTT gp7c-l. 
1202 ACTTCATTGATCACGTGGTAGAGAAGOTrrCTTCAGCACTT gp7d-l . 

TAAQGAGCACTTTAAAACTTQGTAAGAAGATTTAGTCCAT Kajorlty 

* • I I 
1530 1540 1550 1S60 

* ' I I 

1521 TAAGGAOCACTTTAAAACTTGGTAAGAAGATTTAGTCCAT gphknglB15-l . 

1473 TAAGGAGCACTTTAAAACTTGGTAAGAAGATTTAGrrCCAT gp7b-l . 

1245 TAAGGAGCACTTTAAAACTTGGTAAGAAGATTTAGTCCAT gp7c-l . 

1242 TAAGGAOCACTTTAAAACTTGGTAAOAAGATTTAGTCCAT gp7d-l . 

CCTATAATCAGCAAGAATTACACCTTCGGCC AAGACCTGA Hajority 
I , , P 

1570 1580 1590 1600 

« I 1 I. 

1561 CCTATAATCAGCAAGAATTACACCTTCGQCCAAGACCTGA gphkngl815-l . 
1513 CCTATAATCACCAAGAATTACACCTTCGGCCAAGACCTGA gp7b-l . 
1285 CCTATAATCAGCAAGAATTACACCTTCGGCCAAGACCTGA gp7c-l . 
1282 CCTATAAT.CAOCAAOAATTACACCTTCGGCCAAGACCTGA gp7d-l . 

QAATTCTGAAAATACAAAGCAQGCTAACACAATGAACACA Hajority 
•ill 
1610 1620 1630 1640 

« 1 1 L. 

1601 OAATTCTGAAAATACAAAGCAGGCTAACACAATGAACACA gph]cngl815-l. 
1553 GAATTCY6AAAA9ACAAA6CAGGCTAACACAATGAACACA gp7b-l. 
1325 GAATTCTOAAAATACAAAGCAGGCTAACACAATGAACACA gp7c-l . 
1322 QAATTCTOAAAATACAAAGCAGGCTAACACAATGAACACA gp7d-l. 

GCTGCATGAAAGTTAGGTAT ATATTAGGAAGCACTArTGO Majority 
1 , , P 

1650 1660 1670 1680 

1 1 I t. 

1641 GCTGCATQAAAG7TAGOTATATATTAOGAAGCACTATTGG gphkngl8l5-l. 
1593 GCTGCATOAAAGTTAGGTATATATTAGGAAGCACTATTGG gp7b-l , 
1365 GCTGCATGAAAGTTAGGTATATATTAG6AAGCACTATTGO gp7c-l. 
1362 GCTGCATGAAAGTTAGGTATATATTAGGAAGCACTATTOQ gp7d-l. 



FIG. 14F 



wo 99/47535 PCTAJS99/05606 

49/57 



TTTACTTTGTTG AATGGAAGTTTAATAGCTATTCAAATTG Majority 

1^50 1700 1710 1720 

' « ' I 

1681 TTTACTTTGTTGAATGOAAGTTTAATAGCTATTCAAATTG gphknglSlS-l 

1633 TTTACTTTGTTGAATGGAAOTTTAATAGCTATTCAAATTG gp7b-l 

1405 TTTACTTTCTTGAATCCAAGTTTAATAGCTATTCAAATTG gp7c-l 

1402 TTTACTTTGTTGAATGGAAGTTTAATAGCTATTCAAATTG gp7d-l! 

AGTTAATATAAA AATTTCTTCCTAAAAAGTAAAATGTACA Majority 



1730 1740 1750 



1760 



1721 AGTTAATATAAAAATTTCTTCCTAAAAAGTAAAATGTACA gphJcnglBlS-l 

X673 AGTTAATATAAAAATTTCTTCCTAAAAAGTAAAATGTACA gp7b-l 

1445 AGTTAATATAAAAATTTCTTCCTAAAAAGTAAAATGTACA gp7c-l 

1442 AGTTAATATAAA AATTTCTTCCTAAAAAGTAAAATGTACA gp7d-l! 

TATGTAGAATA TGATQCATTAGTTCTTTGTATACTAAATA Majority 

' 1780 1790 .1800 

1761 TATGTAGAATATGATGCATTAGTTCTTTGTATACTAAATA gph)mgl8X5-l 
1713 TATGTAGAATATGATGCATTAGTTCTTTGTATACTAAATA gp7b-l 
1485 TATGTAGAATATGATGCATTAGTTCTTTGTATACTAAATA gp7c-l 
1482 TATGTAGAATATOATGCATTAGTTCTTTGTATACTAAATA gp7d.l! 

AATACTG AGTCCCCT Majority 



1810 



1801 AATACTGAGTCCCCT 
1753 AATACTGAGTCCCCT 
1525 AATACTGAGTCCCCT 
1522 AATACTGAGTCCCCT 



9phkngl815-l. 
gp7b-l. 
flp7c-l . 
9P7d-l. 
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bhkngl 
bh)cn92 
bhkng3 

bhkngX 
bh)cng2 
bh)cng3 

bhkngl 
bhkng3 
bhknga 

bhlcngl 
hhkiig2 
bh]czig3 

bhlcngl 
bhkng2 
bhkng3 



bhkngl 
bhkng2 
bhJcngB 

bhkngl 
bhkng2 
bh)cng3 

bhkngl 
bh)cng2 
bhkng3 

bhkngl. 
bhkng2 
bh)cng3 



bhkngl 
bhkng2 
bhkngs 

bhkngl 
bhkng2 
bhkng3 



80 

GCAACCTCGrroGTGAGAGCCrcaCTTAGTGTC^ 

GTGAAGCmrCTTACA^ 

C ACTCTTC^iGli iA TTTHCTATtTOCTCC^ 
CAATCTTGCnGTTTATCCmJTATCTG™^ 

^TGAAAATCCltaiTCGAAW^^^ 

AGCACGAGGCCCTOA^ 
^TTCCTGGCSACaAATGCJ^ 

TMGAAATCCACCaTT^^ 

^^™™CrraAGAAGTTCftCTGAGGA^ 
GTGGMTTCT^TAACMGA^ 

(nt»(UTTTCTCTAXWU3m5A(rcr^^ 
721 

M»CTTGGGACATTCCa«5CT^^ 
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881 



bhkngl 
bhkng2 
bh)cng3 

bh)cngl 
bhkng3 
bhkng3 

bhkngl 
bhkngS 
bhkng3 

bhkngl 
bhkng2 
bhkng3 



960 



ACAGAGATGCTCAAGGCCATTGAOyunTATCCAA^ 

ACftGAGATGCTGAAGGCCATTGAGGACTTATCCAAACRAGA 

ACAGAGATGCTGAACGCCATTGAGGACTTATCCAAACAAGACAAAGATT^^ 
961 

1040 

GCCI\i'rGCGCXWCAGAGGGCTCT(rrGGAGAACCTC«5CC^ 

1041 ^^^^ 

AATCTCAGGATTACXTATGGGCAGACTGCCCTOCTC^ 

AATGTCAGGATTACCTATGGGCftflACltKXCTG^ 

AATCTCAGGaiTTACCTAlGGaCAGACTO 

AACATATCCAATCAGCAGTAKXXCRGGTACTCCWa 

AACATATCCaVATXaCCACTATGCCCAGCTAC^^ 

AACATATCCAATCACX3CTATGCCC3UOTACTCCA^ 



1201 



bhkngl 
bhkng2 
bhkng3 

bhkngl 
bhkn92 
bhkngS 

bh)cngl 
bhkng2 
bhkng3 



1280 



GATCaGAGAGCA GXllU inWCnAACAGAGCTGGC^ 
^"^^^'^^^^"^^GCAGTTT^^ 

CaTGAGAGAGGAGTTTanTKKJTAACAGAGCro^^ 

' 1360 

^^^^Cea GGTtnT CAaSAAGGaUATTTCTCC^^ 
'''^^Q^^rCCAGCTOITCACGAAGGawmTCT^^ 

1440 

TTCACACTCACC AIXrCTCTTGA AGAAaGTGCTGJ^^ 

"CACaCTCACCATCCCTCTTGAAGAAAGTGCTG^ 

™3U»CTCACCATCrCTCTTC3AAGAA« 



1441 



bhkngl 
bhkng2 
bhkng3 

bhkngl 
bhkng2 
bhkng3 

bhkngl 
bhkng2 
bhkngS 

bhkngl 
bhkng2 
bhkng3 



1520 



TITrA ftGGAAC aTTTTA A ATClTG GTAA(XaflAGT 
TTTTA AGGAAC Ai iA-i.^ iAArcnwrAAGCftflaQXAri^^ ll UCUta TAIXaUlIAGAIGGTTCTraAAA 
:i.j.iiAAS3AACArTTrAAATCTTOOTAAaCAGA^ 

ICOO 

<3WOAAW»TOftCAAAACtftGCTTTTOAai^^ 

1680 

' 1760 

^^T0 TG ATTtXX!>tf3RAACAAACT^ 

ATAyGTaATTOXaGAAAaVAA CIt^l'm 

ATATGTGATTCCCAGAAAC3UU«rTGCnTTTGTCTT^^ 
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1761 



2161 



2401 



bhkngl GTTGGCAGG 

2561 

bhkagi 2640 

bhkngl 2720 

bhkngl 2800 

2801 

bhkngl 2880 

2881 

bhkngl 2960 

bhSa ^^''^^f^^^^^^ 

2961 • 

bhkngl 

bhtag2 ClCTGAATOTTCGAOAarTOT^^ 

bhlcng3 ». - ..^ .« , . 
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- - 81 — " ''"*'^''*^a'iJaja><MCAPTwa)KTAisau^ 

9pl*ngieX5_-a_ J|^«HKHREQ««VSKLM«KSKa^r™^^ 

fefflhkng_aa wvwcaLQHPKBHPRrw 

bhkngl_aa YHLAKAVQHPKBHPKSW 
gphkn9l815_aa_ HWBXVLQKPKEHPKTW 
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matureHKNG - - _ -APTWKDKTAIS 

I^wYVpPtL-, MRTWDYSNSGmCKPPLLVFIVO^WLICDSHCAPTOKDKTAIS 

t^*Sri««, MKPPLLVPIVCLLWLKDSHCAPTWKDKTAIS 

KKKGl-IPFl MKIKAEKNEGPSRSWWQLHWGDIANNSC5NMKPPIJ.WrVa:AWI^ 

2S5ilf®JI!^?««, ENUCSFSEWEIDADEEVTOCALTGIKQMKIMMERKEKEHTt^ 

S^wY tL^ ?^psevgeidadeevkkaltgikqmkimmeiucek^^ 

^S^^izYJ"^^^^ EOTJCSPSEVGEIDADEEVKKALTGIKQMKIMMERKEKEMT^ 
HKNGl - IPFl ENLKSFSEVGEICA^ 

P^EHI'2^^LCRESIJU)SWECRSCLENNCMRIYTT^ 
I^wY^ri^J^ ^ NEVQEHLEEEERLCRESIJUJSWGECRSCLENNCra 

HKNOX-IPFl NBVQEHLEEEERLC^ 

naUireHKNG ^^^NEKDLPISEKLIEEDAQL1X3MEDVFSQLT^^ 

^J7Y\"«^n«n EF^'^^NEKDLPISEKLIEEDAQLTQMEDVFSQLTVn^ 

S^yi;^^'^^^^ ^^J^PJ^NEKDLPISEKLIEEDAQLTQKEDVFSQLTTOTO 

HKMGl-IPFl f^PFHmNEKDLPlSEXLI^^ 

l^^rJi^^^^ TFQSHFISDTDLTEPYFFPAPSKEPMTKADLECKrWDIPNPPQLFCNFSVSIYE^ 
J^J^i:^^"^^^ •^SHPISDTOLTEPYPFPAFSKEPMTKADLEQCWDIPNFFQLFCNP^^ 
HKNGl-IPFl TFQSHFISDTDLTEPYFFPAFSKEPMTKADL^^ 

StSif^I^S««. KMI-KMEDLPKQDKAPDHGGLISKMLPGQDRGI^Et^ 
S^wY^r.?^n»^ ^^AIEDLPKQ0KAPDHGGLISKm.PGQl)RGIX:GELDQN^ 
laiLKAIEDLPKQDKAPDHTCLISKMLPCSQraGl^m^^ 
HKKGl-IPFl IMLKMmLPKQDKMDHGGLl 

25Sif®!H^««, S^VPAIOTELDEAIllLVWSNQQYGQII^MTRKHLEDTA^ 

^^yil^'^^^^ ££™yPAIJrrEIJ>EAIM,VNVSNQQTCQri^MTRK^^ 
HKNGl-IPFl OCPDVPALHT^ 

25SSf®S^™, AraraiiraSICVVPRIHBCmSKOTETMMT^ 
SS2wY^r,P?L^ APETElIIllSIQVVPRIHECOTSKQDEXMima^lLPSSHP^^ 

J A Pgm iFMSIQVVPRIHEGHISKQDElMMTDI^ILP 
HKKGI-IPPl APETEItPNSIQNn^PRIHBCanSKQDETMUTDLSILPSSMFTUa^ 

matureHKKG VAKALQHFKEHFKTW 
HKNGl -VI -IPP3 VAKALQHFKEHFKTW 
HKIIG1/X-V1-IPF2 VAKAliQHFKBHPKTW 
HKKGl-IPFl VAKALQHFKEHFKTW 
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NI liKPGK KI *C 
AAC ATT TTA AAA CCT GGT AAG AAG ATC TAA TCC 

5SSGTWKS*HK 
TCP TCA TCT GGG ACC TGG AAA TCC TGA AAT AAA 

RKVC^LYTMKY 
AGG AAA CTA TGT TAG CTA TAT ACT ATG AAO TAC 

SY*YSN*VKMK 
AGC TAT TAA TAC TCA AAT TGA GTT AAA ATQ AAA 

MYYISHYISSS 
ATG TAT TAC ATT TCA TGG TAC ATT AGT AGT TCT 

L 
CTA 



ILYPVSRII 4S9 
ATC CTA TAT CCA GTA AGT AGA ATT ATC li77 

KG*CNKHSC 479 
AAA GGA TAA TCC AAT AAA CAC AGT TGC 1437 

S*FTYVEWL 499 
TCT TAG TTT ACT TAT GTT GAA TGG CTT 1497 

I P P^KIKRN 519 
ATT CCT CCT TAA AAA ATC AAA CCT AAT 1SS7 
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TTG TAT ATT GAA TAA ATA CTA AAT CAC 1617 
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